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Description 

The present invention relates in general to methods for generating 
and detecting recombinant DNA sequences in Saccharomyces cere- 
visiae and plasmids and S. cerevisiae cells used for conducting the 
5 inventive methods. 

DNA sequences for which these methods are relevant include pro- 
tein-encoding and non-coding sequences; they may also consist of 
larger continuous stretches that contain more than a single coding 
sequence with intervening non-coding sequences, such as those 
10 that as may belong to a biosynthetic pathway. 

The microbial and enzymatic production of substances such as en- 
zymes and other proteins is an important economical topic. Enzymes 
are biocatalytically active proteins not only responsible for the me- 
tabolism of natural compounds and organisms, but also utilized for 

15 the industrial production of natural and non-natural compounds. En- 
zymes or those compounds produced by the help of enzymes can be 
used for the production of drugs, cosmetics, foodstuffs, etc. How- 
ever, the industrial use of enzymes has been greatly hindered by 
their target specifity and the specific conditions under which they can 

20 function. Other proteins have therapeutic applications in the fields of 
human and animal health. Important classes of medically important 
proteins include cytokines and growth factors. 

Proteins, enzymes, and pathways with novel or improved functions 
and properties can be obtained either by searching among largely 
25 unknown natural species or by improving upon currently known natu- 
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ral proteins or enzymes. The latter approach may be more suitable 
for creating properties for which natural evolutionary processes are 
unlikely to have been selected. 

One promising strategy to create such novel desirable properties 
5 and to redesign enzymes, other proteins, non-coding sequences or 
pathways is by directed molecular evolution. Conventionally, as di- 
rect evolution of DNA sequences has been achieved with such tech- 
niques as site-directed mutagenesis, multi-site or cassette 
mutagenesis, random mutagenesis, and error prone PCR Recently, 
10 gene shuffling approaches to optimize or fine-tune the properties of 
enzymes or proteins have attracted much attention. These directed 
evolutionary techniques can produce enzymes that can improve ex- 
isting technology, produce novel products and expand the capabili- 
ties of synthetic chemistry. 

15 A number of different mutagenesis methods exist, such as random 
mutagenesis, site-directed mutagenesis, oligonucleotide cassette 
mutagenesis, or point mutagenesis by error-prone PCR. Random 
mutagenesis, for example, entails the generation of a large number 
of randomly distributed, nucleotide substitution mutations in cloned 

20 DNA fragments by treatment with chemicals such as nitrous acid, 
hydrazine, etc. Error-prone PCR has been developed to introduce 
random point mutations into cloned genes. Modifications that de- 
crease the fidelity of the PCR reaction include increasing the con- 
centration of MgCI 2 , adding MnCI^, or altering the relative concentra- 

25 tions of the four dNTPs. 

These traditional mutagenesis methods focus on the optimization of 
individual genes having discrete and selectable phenotypes. The 
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general strategy is to clone a gene, identify a discrete function for 
the gene, establish an assay by which it can be monitored, mutate 
selected positions in the gene and select variants of the gene for 
improvement in the known function of the gene. A variant having im- 
5 proved function can then be expressed in a desired cell type. 
Repetitive cycles of mutagenesis methods can be carried out to ob- 
tain desirable enzyme properties. 

Each of these conventional approaches has an implicit sequence 
search strategy. The strategies employed in the above techniques of 

10 sequence searching are very different. Performing a saturating site- 
directed mutagenesis search involves a process of installing every 
possible permutation at a site of interest. For a protein, this proce- 
dure consists of replacing an amino acid at a site of interest with all 
19 other amino acids and searching the resultant library for improved 

15 mutants. In sequence space terms this means that a very small re- 
gion has been searched very thoroughly. In comparison, cassette- 
mutagenesis inserts a random peptide sequence in a specific region 
of a protein, giving a less thorough sampling of a larger, defined re- 
gion of sequence space. Error-prone PCR involves repeated copying 

20 of a sequence, with the introduction of a low but significant number 
of errors. In this case, a sparse sampling of a less defined region of 
sequence space is achieved. In each of these strategies, the best 
mutant obtained in each round of selection is used to initiate the next 
round. 

25 However, traditional mutagenesis approaches for evolving new prop- 
erties in enzymes have a number of limitations. First, they are only 
applicable to genes or sequences that have been cloned and func- 
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functionally characterized. Second, these approaches are usually 
applicable only to genes that have a discrete function. Therefore, 
multiple genes that cooperatively confer a single phenotype usually 
cannot be optimized in this manner. Finally, these approaches can 
5 only explore a very limited number of the total number of permuta- 
tions, even for a single gene. In view of these limitations, conven- 
tional mutagenesis approaches are inadequate for improving cellular 
genomes with respect to many useful properties. For example, 
improvements in the capacity of a cell to express a protein might re- 

10 quire alterations in transcriptional efficiency, translation and post- 
translational modifications, secretion or proteolytic degradation of a 
gene product. It therefore might be necessary to modify additional 
genes having a role in one or more of these cellular mechanisms in 
order to express a protein with new properties. Attempting to indh 

15 vidually optimize all of the genes having such function would be a 
virtually impossible task. 

Most of the problems associated with conventional mutagenesis ap- 
proaches can be overcome by gene shuffling approaches. Gene 
shuffling entails randomly recombining different sequences of func- 

20 tional genes, enabling the molecular mixing of naturally similar or 
randomly mutated genes. DNA or gene shuffling, or variations of 
these techniques, have been used to improve the activity, stability, 
folding, and substrate recognition properties of enzymes. In com- 
parison to conventional mutagenesis approaches with gene shuf- 

25 fling, the probability of obtaining mutants with improved phenotype is 
significantly higher. Gene shuffling is fundamentally different from 
conventional strategies in that it recombines favorable mutations in a 
combinatorial fashion. It therefore will search much larger regions of 
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sequence space much more sparsely and with a bias towards pro- 
ducing functional sequences. It also allows more beneficial muta- 
tions from each round of selection to be retained in the next round 
because it allows sequence information to be contributed from more 
5 than one source. Whereas conventional strategies also allow for the 
fixation of negative mutations, this is not the case for gene shuffling 
approaches. Therefore, it is not surprising that gene shuffling strate- 
gies have yielded much more dramatic results. 

DNA or gene shuffling approaches are based on recombination 

10 events between regions with a certain homology or between 
stretches of identity. A key organism used in experiments to examine 
genetic recombination in eukaryotes has been the budding yeast 
Saccharomyces cerevisiae. The study of these processes in a sim- 
ple, unicellular organism has the obvious advantage of the ease of 

1 5 manipulation of DNA sequences and the possibility of studying spe- 
cific recombination events induced synchronously in a large propor- 
tion of cells. Furthermore, over the last few decades a wealth of ex- 
pertise has been accumulated both in the fermentation technology 
and the basic genetics of this organism, which is at present the best 

20 studied eukaryote at the molecular level. Due to its non-pathogenic 
character, its secretion proficiency and its glycosylate potential, S. 
cerevisiae is a preferred host organism for gene cloning and gene 
expression. Therefore, the technical problem underlying the present 
invention is to provide methods and means for the generation of re- 

25 combinant mosaic genes in Saccharomyces cerevisiae. 
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The present invention solves this underlying technical problem by 
providing a process for generating and detecting recombinant DNA 
sequences in Saccharomyces cerevisiae comprising the steps of: 

a) generating first diploid S. cerevisiae cells bearing in a defined 
5 locus of their genome a first recombination cassette compris- 
ing a first DNA sequence to be recombined, which is flanked 
by at least a first and a second marker sequence, in an allelic 
position a second recombination cassette comprising a sec- 
ond DNA sequence to be recombined, which is flanked by at 

1 0 least a third and a fourth marker sequence, 

b) inducing the sporulation of the first diploid cells obtained in a) 
and 

c) isolating haploid cells containing recombination cassettes in 
which first recombined DNA sequences are flanked by the 

15 first and fourth marker sequences, and haploid cells contain- 

ing recombination cassettes in which second recombined 
DNA sequences are flanked by the second and the third 
marker sequences. 

The present invention provides a yeast-based system to screen for 
20 recombination events between at least two diverging DNA se- 
quences. The system is based on the sexual reproductive cycle of S. 
cerevisiae, which alternates between a haploid phase and a diploid 
phase. In the first step of the inventive process, diploid S. cereivisiae 
cells are generated, which are heterozygous for these recombination 
25 substrates. The DNA sequences to be recombined are integrated in 
the genome of the diploid S. cerevisiae cells at allelic positions. 
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Each DNA sequence to be recombined is integrated in the form of a 
recombination cassette, which comprises besides this DNA se- 
quence at least two marker sequences that flank the DNA sequence, 
whereby the two recombination cassettes comprise at least four drf- 
5 ferent marker sequences. 

The heterozygous diploid cells thus obtained are then grown under 
conditions which induce the processes of meiosis and spore forma- 
tion. Meiosis is generally characterized by elevated frequencies of 
genetic recombination, which is initiated via the formation and sub- 

10 sequent repair of double-strand breaks (DSBs) induced early in 
meiosis I prophase. Yeast meiotic cells are therefore of particular 
interest because they experience high levels of recombination as a 
result of the genome-wide induction of DSBs. Thus the products of a 
first round of meiosis, which are haploid cells or spores for each 

15 meiosis event four produced by a parental diploid cell, can contain 
recombined DNA sequences due to recombination between the two 
diverged DNA sequences. 

Recombination between the two diverging DNA sequences during 
meiosis can also lead to an exchange of the flanking marker se- 

20 quences. Therefore, the present process allows a rapid and simple 
identification of recombined DNA sequences by the selection of indi- 
vidual cells or molecules in which an exchange of marker sequences 
flanking a recombination substrate has taken place. The recombi- 
nants obtained after the first round of meiosis are therefore charac- 

25 terized in that they contain and/or express at least one marker se- 
quence of the first recombination cassette and at least one marker 
sequence of the second recombination cassette. In particular, re- 



8 



combinant spores can contain the first marker sequence of the first 
recombination cassette and the fourth marker sequence of the sec- 
ond recombination cassette or the second marker sequence of the 
first recombination cassette and the third marker sequence of the 
5 second recombination cassette, whereby both types of recombinant 
spores contain besides this different marker combination also differ- 
ent recombinant DNA sequences. Both types of spores containing 
recombinant sequences can easily be selected and distinguished 
under conditions that permit selection for the new recombinant 
1 0 marker configurations produced by recombination during meiosis. 

The inventive process can be conducted either in wild-type or mis- 
match repair-defective S. cerevisiae cells. The processes by which 
damaged DNA is repaired and the mechanisms of genetic recombi- 
nation are intimately related, and it is known that the mismatch repair 

15 machinery has inhibitory effects on the recombination frequency be- 
tween divergent sequences, i.e. homeologous recombination. Muta- 
tions of the mismatch repair system therefore greatly enhance the 
overall frequency of recombination events in yeast On the other 
hand, it is known that wild-type S. cerevisiae ceils have a mismatch 

20 repair-dependent recombination mechanism, which is based on dis- 
tantly spaced mismatches in two recombination substrates. Depend- 
ing on the DNA sequences to be recombined, either wild-type or 
mismatch repair-defective S. cerevisiae cells can be used to obtain 
recombined sequences. 

25 The inventive process has the advantage that it is iterative, i.e: it 
allows further rounds of recombination. The products of the first 
round of meiosis, i.e. haploid cells of opposite mating types which 
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comprise different recombined DNA sequences, are mated again to 
obtain diploid cells which are heterozygous for recombined DNA se- 
quences. In the diploid cells thus obtained meiosis is again induced, 
whereby the recombined DNA sequences are once again recom- 

5 bined, leading again to an exchange of the two markers flanking 
each recombination substrate. The new haploid recombinants ob- 
tained after the second meiosis can now be easily identified by the 
joint expression of either those marker genes which flanked the first 
DNA sequence in the original first recombination cassette or those 

10 marker genes which flanked the second DNA sequence in the origi- 
nal second recombination cassette. 

In a preferred embodiment of the invention therefore haploid cells 
containing a recombination cassette with the first recombined DNA 
sequences obtained in the first round of the inventive process are 

15 mated with haploid cells containing recombination cassettes with the 
second recombined DNA sequences obtained in the first round of 
the inventive process in order to generate second, diploid cells. In 
the thus obtained second diploid cell sporulation is induced, result- 
ing in the generation of haploid cells. In the next steps haploid cells 

20 containing recombination cassettes in which third recombined DNA 
sequences are flanked by at least the first and second marker se- 
quences and haploid cells containing recombination cassettes in 
which fourth recombined DNA sequences are flanked by at least the 
third and fourth marker sequences are isolated. 

25 Further recombined DNA sequences can be generated by subjecting 
the haploid cells containing third and fourth recombined DNA se- 
quences to one or more further cycles of mating and meio- 



10 



sis/sporulation. After each round of recombination, recombinants are 
either identified by the joint presence of at least one marker se- 
quence that flanked the first recombination substrate and at least 
one marker sequence that flanked the second recombination sub- 
5 strate or by the joint presence of the two markers that flanked the 
first or the second DNA sequence in the starting recombination sub- 
strates. 

Therefore, an advantageous feature of the present process is that it 
is iterative: recombinant haploid progeny is selected individually or 

10 en masse and mated to one another, the resulting diploids are 
sporulated anew, and their progeny spores are subjected to appro- 
priate selection conditions to identify new recombination events. 
With the inventive process a large library of recombined, mutated 
sequences can be easily generated, and variants that have acquired 

15 a desired function can then be identified by using an appropriate 
selection or screening system. 

In a preferred embodiment of the invention the first diploid S. cere- 
visiae cell is generated by simultaneously or sequentially transform- 
ing a diploid S. cerevisiae cell with a DNA molecule comprising the 

20 first recombination cassette and a DNA molecule comprising the 
second recombination cassette and optionally allowing the integra- 
tion of the two recombination cassettes into allelic positions on natu- 
ral chromosomes of the S. cerevisiae genome. The DNA molecules 
used can also be for example yeast artificial chromosomes (YAC). 

25 YACs are characterized in that they are linear DNA molecules that 
contain all the sequences necessary for stable maintenance in the 
yeast cell, such as a centromere, DNA replication origin and te- 
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lomeres as well as yeast selectable markers. Upon introduction into 
a yeast cell YACs behave similar to natural chromosomes and there- 
fore can be considered as part of the yeast genome. In the context of 
the present invention the term "genome" includes the whole of all 

5 hereditary components present within a cell, which are stably main- 
tained and inherited. In case YACs are used as DNA molecules for 
introduction the first and second recombination cassettes into diploid 
S. cerevisiae cells it is not necessary to integrate the two recombina- 
tion cassettes into allelic positions in natural chromosomes. In the 

10 case in which recombination cassettes are introduced into natural 
chromosomes, it is possible to use a cloning vehicle, for example a 
plasmid, from which a fragment bearing the recombination cassettes 
can be liberated. Preferably the two respective marker sequences of 
the two recombination cassettes are flanked by targeting sequences 

15 which are homologous to a defined locus of the S. cerevisiae ge- 
nome. Alternatively, a DNA molecule can be used which does not 
contain a replication origin. In this case the DNA molecules must be 
able to integrate into a component of the genome and therefore con- 
tain targeting sequences which are homologous to a defined locus of 

20 the S. cerevisiae genome. 

In another preferred embodiment of the invention the first diploid S. 
cerevisiae cells are generated by fusing haploid cells bearing in a 
locus of their genome the first recombination cassette with S. cere- 
visiae haploid cells bearing in an allelic position of their genome the 
25 second recombination cassette. 

In still another preferred embodiment of the invention the first diploid 
S. cerevisiae cells are generated by mating haploid cells bearing in 
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a locus of their genome the first recombination cassette with S. cere- 
visiae haploid cells of opposite mating type bearing in an allelic posi- 
tion of their genome the second recombination cassette. 

In the context of the present invention the terms "mating 0 and "fus- 
5 ing" denote either the purposeful or the random combination of two 
haploid cells containing different recombination cassettes. A pur- 
poseful mating or fusing of two haploid cells occurs, when two se- 
lected and//or isolated haploid cells of opposite mating type with de- 
sired properties are brought into contact under conditions stimulating 

10 mating and fusing, respectively. The two haploid cells can be de- 
rived from the same library of cells, which for example contain DNA 
sequences to be recombined or already recombined DNA se- 
quences, or from different libraries of cells, which for example con- 
tain DNA sequences to be recombined or already recombined DNA 

15 sequences. 

A random mating or fusing of two haploid cells can occur, when a 
plurality of different haploid cells are brought into contact under con- 
ditions stimulating mating and fusing, respectively. The plurality of 
haploid cells can be derived from the same library of cells, which for 
20 example contain DNA sequences to be recombined or already re- 
combined DNA sequences, or from different libraries of cells, which 
for example contain DNA sequences to be recombined or already 
recombined DNA sequences. 

The inventive process for generating and detecting recombined DNA 
25 sequences has the advantage that more than two diverging se- 
quences can be recombined. If, for example, four diverging DNA se- 
quences shall be recombined, then in the first step of the present 
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process two different sets of diploid S. cerevisiae cells can be gen- 
erated. For example, a first set of diploid cells can be generated by 
mating or fusing haploid cells comprising a first and a second DNA 
sequence to be recombined and a second set of diploid cells can be 
5 generated by mating or fusing haploid cells comprising a third and a 
fourth DNA sequence to be recombined. After sporulation of the two 
sets of diploid cells haploid cells obtained from the first diploid cell 
set that contain recombined DNA sequences due to recombination 
between the first and the second DNA sequence, are mated with 
10 appropriate haploid cells obtained from the second diploid cell set 
that contain recombined DNA sequences due to recombination be- 
tween the third and the fourth DNA sequences. The products of this 
mating are diploid cells which after sporulation give rise to haploid 
cells bearing recombined DNA sequences which comprise regions of 
15 the first DNA sequence, the second DNA sequence, the third DNA 
sequence and the fourth DNA sequence. If, for example, three di- 
verging DNA sequences shall be recombined, in the first step of the 
present process diploid S. cerevisiae cells are generated by, for ex- 
ample, mating or fusing haploid cells comprising a first and a second 
20 DNA sequence to be recombined. After sporulation of these diploid 
cells, the haploid cells thus obtained, which contain recombined 
DNA sequences due to recombination between the first and the sec- 
ond DNA sequences, can be fused or mated with haploid cells com- 
prising a third DNA sequence to be recombined. The products of this 
25 mating are diploid cells which after sporulation give rise to haploid 
cells bearing recombined DNA sequences which comprise regions of 
the first DNA sequence, the second DNA sequence and the third 
DNA sequence. In this way, five, six or more diverging DNA se- 
quences can also be recombined. 
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In a preferred embodiment haploid S. cerevisiae cells bearing the 
first or second recombination cassette are generated by: 

a) inserting the first DNA sequence to be recombined between 
the first and the second marker sequence located adjacently 

5 on a first cloning vehicle and inserting the second DNA se- 

quence to be recombined between the third and the fourth 
marker sequence located adjacently on a second cloning ve- 
hicle, whereby the respective two marker sequences are 
flanked by targeting sequences which are homologous to a 
1 0 defined locus of the S. cerevisiae genome, 

b) excising from the cloning vehicles obtained in a) the first re- 
combination cassette and the second recombination cassette 
with flanking targeting sequences, respectively, whereby each 
excised fragment comprises the DNA sequence to be recom- 

15 bined, which is flanked by the respective two marker 

sequences and by targeting sequences, 

c) transforming the excised fragments, obtained in b) separately 
into S. cerevisiae diploid cells, whereby the targeting se- 
quences direct the integration of the cassettes into that locus 

20 to which they are homologous, in order to obtain diploid cells 

heterozygous for the first cassette, or the second cassette, 

d) inducing separately the sporulation of the heterozygous dip- 
loid cells obtained in c) and 

e) isolating haploid cells containing the first cassette flanked by 
25 the first and second marker sequences and separately hap- 
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loid cells containing the second cassette flanked by the third 
and the fourth marker sequences. 

In a preferred embodiment of the invention the respective two marker 
sequences in the first or second cloning vehicle are flanked by tar- 
5 geting sequences which are homologous to the BUD31-HCM1 locus 
on chromosome III of the S. cerevisiae genome and which direct the 
integration of the excised cassettes into that locus. 

In a preferred embodiment of the invention the cloning vehicle used 
for cloning the recombination cassettes is a plasmid. "Plasmid" 
10 means an extrachromosomal element which can autonomously repli- 
cate. The plasmid is physically unlinked to the genome of the cell 
wherein it is contained. Most plasmids are double-stranded circular 
DNA molecules. In another embodiment the cloning vehicle is an 
YAC. 

15 In particular it is preferred to use as the first cloning vehicle, in which 
the first recombination cassette is cloned, plasmid pMXY9. Plasmid 
pMXY9 comprises the URA3 marker gene and the CAN1 marker 
gene. In this plasmid the two marker genes are adjacently located. 
Between the two marker genes are arranged several restriction sites, 

20 in particular recognition sites for the restriction enzymes Smal, Xba\, 
Bgltt and Pad, for inserting a DNA sequence to be recombined. The 
two marker sequences are flanked by targeting sequences homolo- 
gous to the BUD31-HCM1 locus on chromosome III of the S. cere- 
visiae genome. 

25 Furthermore, it is preferred to use as the second cloning vehicle, in 
which the second recombination cassette is cloned, plasmid 
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pMXY12. Plasmid pMXY12 comprises the TRP1 marker gene and 
the CYH2 marker gene. In this plasmid the two marker genes are 
adjacently located. Between the two genes are arranged several re- 
striction sites, in particular recognition sites for the restriction en- 
5 zymes Spel, Smal and Pad, for inserting a DNA sequence to be re- 
combined. The two marker sequences are flanked by targeting se- 
quences homologous to the BUD31-HCM1 locus on chromosome III 
of the S. cerevisiae genome. 

In a preferred embodiment of the invention the diploid cells used for 
10 transformation of the excised recombination cassette are auxotro- 
phic for at least two nutritional factors and resistant to at least two 
antibiotics. Preferably, the diploid cells are homozygous for the ura3- 
1 allele and the trp1-1 allele, which renders the cells auxotrophic for 
uracil and tryptophan, respectively. Furthermore it is preferred that 
15 the diploid cells used for transformation are homozygous for the 
can 1-1 00 allele and the cyh2R allele, which renders them resistant 
to canavanine and cycloheximide, respectively. 

In particular it is preferred, that diploid cells of the S. cerevisiae 
strain MXY47 are used for transformation, which are homozygous for 

20 the alleles ura3-1 x trp1-1, can1-100 and cyh2R and heterozygous for 
the m$h2::KanMX mutation. When diploid cells of the strain MXY47 
are used for the transformation with the excised first or second frag- 
ments bearing recombination cassettes and their flanking targeting 
sequences, then transformants obtained can be sporulated to yield 

25 haploid wild type or m$h2 segregants that bear the respective re- 
combination cassette. 
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According to the invention it may be preferred to use S. cerevisiae 
cells which have a functional mismatch repair system for the inven- 
tive process. The mismatch repair system belongs to the largest con- 
tributors to avoidance of mutations due to DNA polymerise errors in 

5 replication. Mismatch repair also promotes genetic stability by edit- 
ing the fidelity of genetic recombination. It is known that, therefore, 
the mismatch repair machinery has a somewhat inhibitory effect on 
recombination between diverged sequence. However, in a normal S. 
cerevisiae diploid another aspect of mismatch repair, termed mis- 

10 match repair-dependent recombination, was detected (Borts and 
Haber, Science, 237 (1987), 1459-1466). It is thought that the mis- 
match repair of widely spaced mismatches such as in diverged se- 
quences leads to new double-strand breaks that can in turn stimu- 
late a second round of (mismatch repair-dependent) recombination. 

15 In certain circumstances, in particular, when it is known that the two 
recombination substrates used have widely spaced base differences, 
it is therefore useful to employ S. cerevisiae cells with a functional 
mismatch repair system for conducting the inventive process. 

In another preferred embodiment of the invention, S. cerevisiae cells 
20 that are deficient in the mismatch repair system are used. In S. cere- 
visiae several genes have been identified whose products share 
homology with bacterial mismatch repair proteins, including six 
homologues of the MutS protein, i.e. Msh1, Msh2p, Msh3p, Msn4, 
Msh5 and Msh6p, and four homologues of the MutL protein, i.e. 
25 Wllhlp, Mlh2p, Mlh3p, and Pms1. It is known that in particular the 
PMS1 and MSH2 genes set up a barrier to the recombination of di- 
verged sequences. Therefore, in msh2 and pmsl mutants, meiotic 



18 



recombination between diverged sequences is increased, relative to 
the frequency of recombination in wild type cells. 

In the context of the present invention the term "deficient in the mis- 
match repair system" means that the mismatch repair system (MMR) 
5 of a cell is transiently or permanently impaired. MMR deficiency of a 
cell or an organism can be achieved by any strategy that transiently 
or permanently impairs the mismatch repair including but not limited 
to a mutation of one or more genes involved in mismatch repair, 
treatment with an agent like UV light, which results in a global im- 

10 pairment of MMR, treatment with an agent like 2-aminopurin or a 
heteroduplex containing an excessive amount of mismatches to 
transiently saturate and inactivate the MMR system and inducible 
expression or repression of one or more genes involved in the mis- 
match repair, for example via regulatable promoters, which would 

15 allow for transient inactivation, i.e. during meiosis, but not during 
vegetative growth. 

In a preferred embodiment of the invention the mismatch repair defi- 
ciency of the S. cerevisiae cell is due to a mutation of at least one 
gene involved in the MMR. In a preferred embodiment the S. cere- 
20 visiae cells are deficient in the MSH2 gene. Preferably, diploid cells 
are homozygous for the msh2 allele, in which the MSH2 coding se- 
quences are replaced by the KanMX construct. 

In the context of the present invention the term "recombination cas- 
sette" refers to a DNA sequence comprising at least one recombina- 
25 tion substrate or one DNA sequence to be recombined, which is 
flanked by at least two different marker sequences. The first and the 
second recombination cassette differ in the DNA sequences to be 
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recombined and in the flanking marker sequences, such that any 
pair of recombination cassettes comprises two different DNA se- 
quences to be recombined and at least four different flanking marker 
sequences. 

5 In a preferred embodiment of the invention both the first and the 
second recombination cassettes are generated by inserting the re- 
spective DNA sequences to be recombined between two marker se- 
quences that are closely located on a cloning vehicle and -which in 
turn are surrounded by targeting sequences that are homologous to 

10 a defined locus of the S. cerevisiae genome. The targeting se- 
quences therefore can direct the integration of an excised fragment 
containing a recombination cassette into this defined locus. The in- 
sertion of the DNA to be recombined between the two marker se- 
quences is preferably effected by genetic engineering methods. In a 

15 preferred embodiment of the invention the two marker sequences in 
the cloning vehicle are flanked by targeting sequences which are 
homologous to the BUD31-HCM1 locus on chromosome III of the S. 
cerevisiae genome. Therefore, the targeting sequences direct the 
integration of the excised fragments containing a recombination cas- 

20 settes into that locus. 

In the context of the present invention the terms "DNA sequences to 
be recombined" and "recombination substrate" mean any two DNA 
sequences that can be recombined as a result of meiotic recombina- 
tion processes, whereby recombination between these sequences 
25 can be due to homologous or non-homologous recombination. 

Homologous recombination events of several types are character- 
ized by the base pairing of a damaged DNA strand with a homolo- 
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gous partner, where the extent of interaction can involve hundreds of 
nearly perfectly matched base pairs. The term "homology" denotes 
the degree of identity existing between the sequence of two nucleic 
acid molecules. In contrast, illegitimate or non-homologous recombi- 
5 nation is characterized by the joining of ends of DNA that share no 
or only a few complementary base pairs. In yeast, non-homologous 
repair and recombination events occur at significantly lower frequen- 
cies than homologous recombination events. 

The first and second DNA sequences to be recombined are diverg- 
10 ing sequences, i.e. sequences, which are not identical but show a 
certain degree of homology. This means, that the DNA sequences to 
be recombined diverge by at least one nucleotide. Preferably the 
DNA sequences to be recombined are sequences that share at least 
one or more homologous regions, which can be very short The ho- 
15 mologous regions should comprise at least 5-10 nucleotides, pref- 
erably more than 20-30 nucleotides, more preferred more than 30-40 
nucleotides and most preferred more than 50 nucleotides. In a pre- 
ferred embodiment of the invention the first and the second DNA 
sequences to be recombined diverge by at least one nucleotide, in 
20 particular more than 0,1 %, preferably more than 5 % to more than 
50 %. This means, that the first and second DNA sequences to be 
recombined can also diverge by 55%, 60%, 65 % or even more. 

Recombination substrates or DNA sequences to be recombined can 
have a natural or synthetic origin. DNA sequences to be recombined 
25 therefore can be derived from any natural source including viruses, 
bacteria, fungi including S. cerevisiae, animals, plants and humans. 
In a preferred embodiment of the invention the first and the second 
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DNA sequences to be recombined are derived from organisms other 
than S. cerevisiae. 

In a preferred embodiment of the invention DNA sequences to be 
recombined are protein-encoding sequences, for example se- 
5 quences encoding enzymes, which can be utilized for the industrial 
production of naturaland non-natural compounds. Enzymes or those 
compounds produced by the help of enzymes can be used for the 
production of drugs, cosmetics, foodstuffs, etc. Protein-encoding se- 
quences can also be sequences, which encode proteins, that have 
10 therapeutic applications in the fields of human and animal health. 
Important classes of medically important proteins include cytokines 
and growth factors. The recombination of protein coding sequences 
allows for the generation of new mutated sequences which code for 
proteins with altered, preferably improved functions and/or newly 
1 5 acquired functions. In this way it is possible, for example, to achieve 
improvements in the thermostability of a protein, to change the sub- 
strate specificity of a protein, to improve its activity, to evolve new 
catalytic sites and/or to fuse domains from two different enzymes. 
Protein coding DNA sequences to be recombined can include se- 
20 quences from different species which code for the same or similar 
proteins that have in their natural context similar or identical func- 
tions. Protein coding DNA sequences to be recombined can include 
sequences from the same protein or enzyme family. Protein coding 
sequences to be recombined can also be sequences which code for 
25 proteins with different functions - for example, sequences that code 
for enzymes which catalyse different steps of a given metabolic 
pathway. In a preferred embodiment of the invention the first and the 
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second DNA sequences to be recombined are selected from the 
group of gene sequences of the Oxa superfamily of B-lactamases. 

In another preferred embodiment of the invention DNA sequences to 
be recombined are non-coding sequences such as sequences, 
5 which, for example, are involved within their natural cellular context 
in the regulation of the expression of a protein-coding sequence. 
Examples for non-coding sequences include but are not limited to 
promoter sequences, sequences containing ribosome binding sites, 
intron sequences, polyadenylation sequences etc. By recombining 
10 such non-coding sequences it is possible to evolve mutated se- 
quences, which in a cellular environment result in an altered regula- 
tion of a cellular process - for example, an altered expression of a 
gene. 

According to the invention a recombination substrate or DNA se- 
15 quence to be recombined can of course comprise more than one 
protein coding sequence and/or more than one non-coding se- 
quence. For example a recombination substrate can comprise one 
protein coding sequence plus one non-coding sequence or a combi- 
nation of different protein coding sequences and different non- 
20 coding sequences. In another embodiment of the invention DNA se- 
quences to be recombined therefore can consist of one or more 
stretches of coding sequences with intervening and/or flanking non- 
coding sequences. That means, the DNA sequence to be recom- 
bined can be for example a gene sequence with regulatory se- 
25 quences at its 5'-terminus. and/or an untranslated 3-region or an 
mammalian gene sequence with an exon/intron structure. In still an- 
other embodiment of the invention DNA sequences to be recom- 
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bined can consist of larger continuous stretches that contain more 
than a single coding sequence with intervening non-coding se- 
quences, such as those that as may belong to a biosynthetic path- 
way or an operon. DNA sequences to be recombined can be se- 
5 quences, which have already experienced one or more recombina- 
tion events, for example homologous and/or non-homologous re- 
combination events. 

The recombination substrates can comprise non-mutated wild-type 
DNA sequences and/or mutated DNA sequences. In a preferred em- 
10 bodiment therefore it is possible to recombine wild-type sequences 
with already existing mutated sequences in order to evolve new mu- 
tated sequences. 

In the context of the present invention the term "marker sequences" 
refers to unique DNA sequences that are positioned upstream or 

15 downstream of a recombination substrate or an already recombined 
DNA sequence in Saccharomyces cerevisiae cells. The presence of 
a marker sequence on the same molecule of DNA as the recombina- 
tion substrate or already recombined DNA sequence, preferably in 
combination with another marker sequence positioned on the other 

20 side of the recombination substrate, allows that recombination sub- 
strate or already recombined DNA sequence to be recognized and 
selected for, whether by molecular or genetic methods. Therefore, in 
one preferred embodiment of the invention there must be one or 
more marker sequences upstream of each recombination substrate 

25 and one or more marker sequences downstream of each recombina- 
tion substrate, such that in a cell heterozygous for two different re- 
combination substrates, there are at least four different marker se- 
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quences altogether. This arrangement allows for the selection of 
crossovers involving recombination substrates. It also allows further 
rounds of recombination to be carried out in a iterative fashion. In 
another preferred embodiment of the invention more than one 
5 marker can be situated on each side of the recombination substrate. 
For example, additional markers can be introduced to increase the 
stringency of selection. 

Marker sequences may comprise protein-encoding . or non-coding 
DNA sequences. In a preferred embodiment of the invention the pro- 

10 tein-encoding marker sequences are selected from the group con- 
sisting of nutritional markers, pigment markers, antibiotic resistance 
markers, antibiotic sensitivity markers and sequences that encode 
different subunits of an enzyme, which functions only, if both or more 
subunits are expressed in the same cell. In a further preferred em- 

15 bodiment of the invention the molecular non-coding marker se- 
quences include but are not limited to primer recognition sites, i.e. 
sequences to which PGR primers anneal and which allow an amplifi- 
cation of recombinants, intron/exon boundaries, promoter se- 
quences, downstream regulated gene sequences or restriction en- 

20 zyme sites. 

A "nutritional marker" is a marker sequence that encodes a gene 
product that can compensate an auxotrophy of an organism or cell 
and thus can confer prototrophy on that auxotrophic organism or 
cell. In the context of the present invention the term "auxotrophy" 
25 means that an organism or cell must be grown in a medium contain- 
ing an essential nutrient which cannot be synthesized by the 
auxotrophic organism itself. The gene product of the nutritional 
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marker gene promotes the synthesis of this essential nutrient miss- 
ing in the auxotrophic cell Therefore, upon expression of the nutri- 
tional marker gene it is not necessary to add this essential nutrient to 
the medium in which the organism or cell is grown, since the organ- 
5 ism or cell has acquired prototrophy. 

A "pigment marker" is a marker gene wherein the gene product is 
involved in the synthesis of a pigment which upon expression will 
stain that cell, in which the pigment marker is expressed. A cell with- 
out the pigment marker does not synthesize the pigment and is 
10 therefore not stained. The pigment marker therefore allows a rapid 
phenotypical detection of that cell containing the pigment marker. 

An "antibiotic resistance marker* is a marker gene wherein the gene 
product confers upon expression to a cell, in which the expression of 
the antibiotic marker gene takes place, the ability to grow in the 
15 presence of a given antibiotic at a given concentration, whereas a 
cell without the antibiotic resistance marker cannot. 

An "antibiotic sensitivity marker" is a marker gene wherein the gene 
product destroys upon expression the ability of a cell to grow in the 
presence of a given antibiotic at a given concentration. 

20 In a preferred embodiment of the invention each of the gene prod- 
ucts of the first and third marker sequences can compensate an 
auxotrophy of a S. cerevisiae cell. Preferably, the first marker se- 
quence is URA3, the gene product of which can confer uracil proto- 
trophy to a uracil auxotrophic S. cerevisiae cell. Preferably, the third 

25 marker sequence is TRP1, the gene product of which can confer 
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tryptophan prototrophy to an tryptophan auxotrophic S. cerevisiae 
cell. 

In another preferred embodiment of the invention the gene products 
of the second and fourth marker sequences confer sensitivity to an 
5 antibiotic to a S. cerevisiae cell which is resistant to that antibiotic. 
Preferably, the second marker sequence is CAN1, the gene product 
of which can confer to a canavanine resistant S. cerevisiae cell sen- 
sitivity to canavanine. Preferably, the fourth marker sequence is 
CYH2, the gene product of which can confer to a cycloheximide re- 
1 0 sistant S. cerevisiae cell sensitivity to cycloheximide. 

In another preferred embodiment of the invention the marker se- 
quences comprise annealing sites for PCR primers. Preferably, the 
first, second, third and fourth marker sequences are recognized by 
the primers KNS11, KNS28, KNS16, and KNS29. 

15 In a preferred embodiment of the inventive process haploid cells 
containing recombination cassettes with either first, second, third or 
fourth recombined DNA sequences can be identified by PCR proc- 
esses in order to detect the presence of the respective marker com- 
bination. 

20 In another preferred embodiment of the inventive process haploid 
cells containing recombination cassettes with either first, second, 
third or fourth recombined DNA sequences are identified by plating 
the haploid cells on media that select for the presence on the same 
DNA molecule of the respective marker combination. This means 

25 that haploid cells containing first recombined DNA sequences are 
plated on a medium that selects for the presence of the first and the 
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fourth marker sequences. Haploid cells containing second recom- 
bined DNA sequences are plated on a medium that selects for the 
presence of the second and the third marker sequences. Haploid 
cells containing third recombined DNA sequences are plated on a 
5 medium that selects for the presence of the first and the second 
marker sequences. Haploid cells containing fourth recombined DNA 
sequences are plated on a medium that selects for the presence of 
the third and the fourth marker sequences. 

Another aspect of the present invention relates to a process of gen- 
10 erating novel proteins, enzymes, pathways and non-coding se- 
quences with novel or improved functions and properties, whereby 
known protein-coding sequences or known non-coding sequences 
are subjected one or more recombination rounds by using the inven- 
tive process for generating and detecting recombinant DNA se- 
15 quences in S. cerevisiae. 

Another aspect of the present invention relates to plasmid pMXY9. 
Plasmid pMXY9 comprises the URA3 marker gene and the CAN1 
marker gene, which are located .adjacently. Between the two marker 
gene a polylinker sequence, comprising several restriction sites for 

20 inserting a DNA sequence to be recombined, is arranged. The two 
markers are flanked by targeting sequences homologous to the 
BUD31-HCM1 locus on chromosome III of the S. cerevisiae genome. 
The polylinker sequence between the two marker genes comprises 
restriction sites for the restriction enzymes Sma\, Xba\, BgS\\ and 

25 Pad. 

Another aspect of the present invention relates to plasmid pMXY12. 
Plasmid pMXY12 comprises the TRP1 marker gene and the CYH2 
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marker gene. Between the two marker genes a polylinker sequence 
comprising several restriction sites for inserting a DNA sequence to 
be recombined is arranged. The two markers are flanked by target- 
ing sequences homologous to the BUD31-HCM1 locus on chromo- 
5 some III of the S. cerevisiae genome. The polylinker sequence com- 
prises restriction sites for the restriction enzymes Spel, Sma\ and 
Pad * 

The present invention relates also to the S. cerevisiae strain MXY47, 
characterized in that diploid cells thereof are homozygous for the 
1 0 alleles i/ra3-7, trp1-1, can1-100 and cyh2R and heterozygous for the 
rnsh2::KanMX mutation. 

The present invention also relates to the E. coli strain JM101, con- 
taining plasmid pMXY9, and to E. coli strainDH5a, containing plas- 
mid pMXY1 2. 

1 5 Another aspect of the present invention relates to a kit which can be 
used for conducting the inventive process for generating and detect- 
ing recombined DNA sequences in Saccharomyces cerevisiae. In a 
first embodiment the kit comprises at least a first container which 
contains cells of S. cerevisiae strain MXY47, a second container 

20 which contains cells of E. coli strain JM101 bearing plasmid pMXY9 
and a third container containing cells of E. coli strain DH5<x bearing 
plasmid pMXY12. 

In a second embodiment the kit comprises at least a first container 
containing cells of S. cerevisiae strain MXY47, a second container 
25 containing DNA of plasmid pMXY9 and a third container containing 
DNA of plasmid pMXY12. 
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The present invention is illustrated by the following sequence listing, 
figures and example. 

Figure 1 shows a schematic of the selection system for the selection 
of recombinants on defined media. Diploid parental cells heterozy- 

5 gous for recombination cassettes - here, recombination substrate A, 
flanked by the URA3 and CAN1 genes, and recombination substrate 
B, flanked by the TRP1 and CYH2 genes - are induced to undergo 
meiosis. Spores are plated on medium lacking Uracil and containing 
canavanine (-Ura+Can) and on medium lacking tryptophan and con- 

10 taining cycloheximide (-Trp+Cyh) to select for recombinant cells 3 
and 4, in which a crossover involving the recombination substrates A 
and B has taken place, as indicated by (+). Parental diploids and 
non-recombinant haploids 1 and 2 cannot grow on either of these 
media, as indicated by (-). A subsequent round of meiosis may use 

15 recombinants 3 and 4 to construct a new diploid, which when 
sporulated yields new recombinant cells bearing the same flanking 
marker configurations as those shown in cells 1 and 2. Recombinant 
spore colonies with these configurations can be selected on medium 
lacking uracil and containing cycloheximide (-Ura+Cyh), and on me- 

20 dium lacking tryptophan and containing canavanine(-Trp+Can), re- 
spectively. 

Figure 2 shows the plasmids pMXY9 and pMXY12 (above), which 
are vectors used for the targeting of recombination cassettes to the 
BUD31-HCM1 locus on chromosome III of the yeast genome. Both 
25 plasmids bear sequences homologous to this locus (indicated as 5' 
and 3'), which flank the URA3 and CAN1 markers (pMXY9) or TRP1 
and CYH2 (pMXY1 2) markers. A short sequence bearing restriction 
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sites that allow for cloning recombination substrates is located be- 
tween each pair of marker sequences. Below, integration of recom- 
bination cassettes Into the BUD31-HCM1 locus. A pMXY9 derivative 
bearing recombination substrate A is digested with Nott to liberate 
5 the recombination cassette flanked by 5 f and 3' targeting sequences 
and digestion products are transformed into MXY47 cells. Ur&+ de- 
rivatives that contain a correctly targeted insert are identified for 
subsequent use in constructing strains heterozygous for recombina- 
tion cassettes. Recombination cassettes bearing the TRP1 and 
10 CYH2 markers are similarly constructed in pMXY12 and trans- 
formed into MXY47, followed by selection for tryptophan prototrophy. 

Figure 3 shows the frequency of recombination between Oxa genes 
as a function of sequence identity In wild type and m$h2 strains. 
15 Above, the mean ± standard deviation of (n) independent expert- 
ments is provided. Below, graphical representation of these data. 
The following strains were used: MXY60, MXY62, MXY64, MXY66, 
MXY99,andMXY102. 

Figure 4 shows the msh2 hyper-recombination effect. A ms/72/wt ra- 
20 tio was calculated for each independent experiment (total number = 
n) for pairs of strains with the given percent of shared Oxa homology 
and for each selection condition, and the the mean ± standard devia- 
tion of these summed values are shown. The data are represented 
graphically below. The following pairs of strains were used: MXY60 
25 and MXY62, MXY64 and MXY66, MXY99 and MXY102. 

Figure 5 shows a PCR analysis of recombination between Oxa se- 
quences sharing 78% homology. Spore colonies were derived from 
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wild type (MXY99) and msh2 (MXY102) diploids by selection on me- 
dium lacking uracil and containing canavanine, or on medium lacking 
tryptophan and containing cycloheximide. Colony PCR was per- 
formed on selected spore colonies that exhibited phenotypes consis- 

5 tent with those expected for crossover recombinants. Above, two 
reactions were carried out for each wild type and msh2 Ura+CanR 
candidate, one with a parental-specific primer pair (KNS16 + KNS28, 
products shown in the first of each pair of lanes for each candidate), 
and the other with a recombinant-specific primer pair (KNS16 + 

10 KNS29, second lane). Below, similar reactions were carried out for 
each wild type and msh2 Trp+CyhR candidate, one with a parental- 
specific primer pair (KNS11 + KNS29, first lane), and the other with a 
recombinant-specific primer pair (KNS11 + KNS28, second lane). 
Control reactions were carried out on appropriate genomic DNA 

15 templates containing known configurations of flanking marker se- 
quences, either parental (P) or recombinant (R). (-) no DNA control. 

Figure 6 shows the frequencies of recombination for second-round 
recombination. Wild type and msh2 haploids obtained after a first 
round of recombination with MXY64 and MXY66 were mated to pro- 

20 duce wild type (MXY81, MXY82 and MXY83) and msh2 (MXY86, 
MXY87, and MXY88) diploids with mosaic Oxa7-Oxa11 recombina- 
tion cassettes. Wild type (MXY90) and msh2 (MXY92) diploids ho- 
mozygous for the Oxa11 recombination substrate were also con- 
structed from recombinant progeny of MXY60 and MXY62. All dip- 

25 loids were sporulated and spores were plated on media to select for 
Ura+CanR and Trp+CyhR recombinants. 
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The sequence listing comprises the following sequences: 

SEQ ID No. 1 and 2 show the sequences of the primers MSH2UP 
and MSH2DN, respectively, for the amplification of MSH2. 

SEQ ID No. 3 to SEQ ID No. 6 show the sequences of the primers 
5 MSH2A1, MSH2A2, MSH2A3 and MSH2A4, respectively, which are 
MSH2-specific analytical primers. 

SEQ ID No. 7 and SEQ ID No. 8 show the sequences of the primers 
K2KANMX and K3KANMX, respectively, which are KanMK-specific 
analytical primers. 

10 SEQ ID No. 9 and SEQ ID No. 10 show the sequences of the prim- 
ers LEU2UP and LEU2DN, respectively, which are used for the am- 
plification of LEU2. 

SEQ ID No. 1 1 and SEQ ID No. 12 show the sequences of the prim- 
ers HIS3UP and HIS3DN, respectively, which are used for the ampli- 
15 ficationof HIS3. 

SEQ ID No. 13 and SEQ ID No. 14 show the sequences of the prim- 
ers KNS1 and KNS2, respectively, which are used for the amplifica- 
tion of the 3' targeting sequence. 

SEQ ID No. 15 to SEQ ID No. 17 show the sequences of the primers 
20 KNS3, KNS4 and KNS6, respectively, which are used for the amplifi- 
cation of a 5' targeting sequence. 
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SEQ ID No. 18 and SEQ ID No. 19 show the sequences of the prim- 
ers KNS7 and KNS8, respectively, which are used for the amplifica- 
tion of Oxa7. 

SEQ ID No. 20 and SEQ ID No. 21 show the sequences of the prim- 
5 ers KNS9 and KNS10, respectively, which are used for the amplifica- 
tion of Oxa11. 

SEQ ID No. 22 shows the sequence of the primer KNS1 2, which is a 
BUD31 downstream analytical primer. 

SEQ ID No. 23 shows the sequence of the primer KNS13, which is a 
10 BUD31 upstream analytical primer. 

SEQ ID No. 24 shows the sequence of the primer KNS14, which is a 
TRPf-specific analytical primer. 

SEQ ID No. 25 shows the sequence of the primer KNS15, which is a 
C/R>A3-specific analytical primer. 

15 SEQ ID No. 26 and SEQ ID No. 27 show the sequences of the prim- 
ers KNS17 and KNS18, respectively, which are used for the amplifi- 
cation of CYH2. 

SEQ ID No. 28 shows the sequence of the primer KNS30, which is a 
7RPf- specific forward primer used as sequencing primer. 

20 SEQ ID No. 29 shows the sequence of the primer KNS31 , which is a 
CAN1 -specific reverse primer used as sequencing primer. 
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SEQ ID No. 30 shows the sequence of the primer KNS33, which is a 
CYH2-specific reverse primer used as sequencing primer. 

SEQ ID No. 31 and SEQ ID No. 32 show the sequences of the prim- 
ers KNS36 and KNS37, respectively, which are used for the amplifi- 
5 cation of 0xa5. 

SEQ ID No. 33 shows the sequence of the primer KNS38, which is a 
(7RA3-specific forward primer used as sequencing primer. 

Example - Generation of mosaic genes in Saccharomvces cere- 
visiae mismatch repair mutants 

10 1. Materials and methods 

1.1 Media 

Standard rich medium YPD (Bio101) was used for routine growth, 
and synthetic dropout media (Bio101) were used to monitor genetic 
markers and for selection of recombinants. For sporulation, cells we- 

15 re precultured overnight in SPS (50 mM potassium phthalate, pH 
5.0, 0.5% yeast extract (Difco), 1% Bactopeptone (Difco), 0.17% 
yeast nitrogen base, 1% potassium acetate, 0.5% ammonium sulfa- 
te) plus required nutritional supplements, washed, resuspended in 
1% potassium acetate plus supplements and incubated with shaking 

20 for two days. AH manipulations were carried out at 30°C. For tetrad 
analysis, asci were digested with Helix pomatia B-glucuronidase 
(Sigma) and dissected using a Nikon Eclipse E400 microscope fitted 
with a TDM400 micromanipulator (Micro Video Instruments, Inc.). 
Other genetic methods were conducted as described by Ausubel et 
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al. Current Protocols in Molecular Biology (1998), John Wiley and 
Sons, Inc., New York. All yeast transformations were performed u- 
sing the LiAc method according to Agatep et al., Technical Tips On- 
line (http://tto.trends.com). 

5 1.2. Yeast strains 

All yeast strains used or created in this study are listed in Table 1 
and Table 2. All yeast strains are isogenic derivatives of the readily 
sporulating W303 background. The diploid MXY47, which serves as 
a host for transformation with recombination cassettes, was con- 
10 structed by transformation and genetic crosses as follows. The hap- 
loid D184-1B (a gift of S. Gangloff, CEA, France) was transformed 
with a LEU2 fragment (obtained by preparatory PCR of the W303 
strain U474 with the primer pair LEU2UP/LEU2DN which are listed in 
the sequence listing to yield the Leu+ haploid MXY13. The haploid 
15 D184-1C (a gift of S. Gangloff) was transformed with a HIS3 frag- 
ment (obtained by preparatory PCR of ORD4369-25D with the 
primer pair HIS3UP/HIS3DN) to yield the His+ haploid MXY25. The 
haploids MXY18 and MXY22 are recessive cycloheximide-resistant 
{cyh2R) derivatives of D184-1B and D184-1C, respectively, selected 
20 on 10 ng/ml cycloheximide; the presence of mutations mapping to 
the CYH2 locus that confer cycloheximide resistance was confirmed 
by sequencing (two different nucleotide alterations resulting in a 
change of glutamine 38 to lysine) and segregation analysis. MXY18 
and MXY25 were crossed to obtain the diploid MXY29; MXY13 and 
25 MXY22 were crossed to obtain the diploid MXY33. The haploid seg- 
regants MXY29-6D and MXY33-8C were crossed to obtain MXY38, 
which is heterozygous for the Ieu2-3,112 and his3-11,15 markers 
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and homozygous for the cyh2R mutation. MXY38 was transformed 
with the msh2::KanMX cassette amplified by PGR from RBT348 (a 
gift of R. Borts, University of Leicester) with the primers MSH2UP 
and MSH2DN to yield MXY47. Transformants were selected on 200 
5 yg/ml G418 (Invitrogen) and confirmed by colony PCR (see below) 
with the primers MSH2A1, MSH2A2, MSH2A3 and MSH2A4 and by 
tetrad analysis, i.e. analysis of the four spores, to confirm marker 
segregation. 

1 0 Table 1 . Haploid yeast strains 



Name 


Genotype 


Source or derivation 


D184-1B 


a ura3-1 trp1-1 can1-1Q0his3-11, 15 Ieu2- 
3,112 ade2-1 


S. Gangloff 


D184-1C 


alpha ura3~1 trp1-1 can1-100 hls3-11,15 
Ieu2-3,112 ad&2-1 


S. Gangloff 


U474 


alpha ura3-1 trp1-1 can1-100 his3-11,15 
ade2-1 


S. Gangloff 


ORD4369- 
25D 






MXY13 


a um3~1 trp1-1 caM-100 h!s3-11,15 
ade2-1 


D1S4-1B transformed 
with LEU2 PCR prod- 
uct 


MXY18 


a ura3 -1 trpM can1-100 cyh2R hi$3- 
11,15 Ieu2-3,1 12 ade2-1 


D184-1B cyhR deriva- 
tive 


MXY22 


alpha ura3-1 trp1-1 can1-100 cyh2R hi$3~ 
11,15 Ieu2-3,112ade2~1 


D184-1C cyhR deriva- 
tive 


MXY25 


alpha ura3~1 trp1-1 can1-100 Ieu2~3,112 
ad*2-1 


D184-1C transformed 
with HIS3 PCR prod- 
uct 


MXY29- 
6D 


alpha ura3-1 trp1-1 can1-100 cyh2R Ieu2- 
3,112 ade2-1 


MXY29 segregant 



37 



MXY33- 
8C 


a ura3-1 trp1-1 can1-100 cyh2R hls3- 
11,15 ade2-1 


MXY33 segregant 


MXY50- 
3D 


alpha msh2::KanMX ura3-1 trp1-1 cyh2R 
oan1-100 Ieu2-3,112 ade2-1 
BUD31-URA3-CAN1 


MXY50 segregant 


MXY50- 
7D 


alpha ura3-1 trp1-1 cyh2R canl-100 hls3- 
11,15 ade2-1 BUD31::URA3-CAN1 


MXY50 segregant 


MXY51- 
2B 


alpha mshZ:KanMX ura3-1 trp1-1 cyh2R 
can1-100 his3-11,15 ade2-1 
BUD31::URA3-Oxa7-CAN1 


MXY51 segregant 


MXY51- 
10C 


alpha ura3-1 trp1-1 cyh2R can1-100 Ieu2- 
3,112 ade2-1 BUD31::URA3-Oxa7-CAN1 


MXY51 segregant 


MXY52- 
2A 


alpha mshZ:KanMX ura3-1 trp1-1 cyh2R 
can1-100 Ns3-11,15 ade2-1 
BUD31::URA3-0xa11-CAN1 


WIXY52 segregant 


MXY52- 
7D 


alpha ura3-1 trp1-1 cyh2R can1-100 teu2- 
3,112 ade2*1 BUD31::URA3-Oxa11- 
CAN1 


MXY52 segregant 


MXY53- 
11C 


a ura3-1 trp1-1 cyh2R can1-100 Ieu2~ 
3,112 ade2-1 BUD31::TRP1-CYH2 


MXY53 segregant 


MXY53- 
11D 


a msh2::KanMX ura3-1 trp1-1 can1-100 
cyh2R his3-11,1S ade2-1 BUD31::TRP1- 
CYH2 


MXY53 segregant 


MXY55- 
1C 


a msh2::KanMX ura3-1 trp1-1 can1-100 
cyh2R his3-11,15 ade2-1 BUD31::TRP1- 
Oxa11-CYH2 


MXY55 segregant 


MXY55- 
2B 


a ura3-1 trp1-1 canl-100 cyh2R his3- 
11,15 ade2-1 BUD31::TRP1-0xa11- 
CYH2 


MXY55 segregant 


MXY55- 
13D 


a m$h2::KanMX ura3-1 trp1-1 can1-100 
cyh2R Ieu2-3,112 ade2-1 BUD31::TRP1- 
Oxa11-CYH2 


MXY55 segregant 
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MXY79- 
3B 


alpha ura3-1 trp1~1 can1-100 cyh2R his3- 
11,15 leu2-3,112 ade2-1 BUD31::URA3- 
Oxa5-CAN1 


MXY79 segregant 


MXY79- 
9A 


alpha msh2::Kan MX ura3-1 trp1-1 canl- 
100 cyh2R leu 2-3, 1 12 ade2-1 
BUD31::URA3-Oxa5-CAN1 


MXY79 segregant 


RBT348 


alpha msh2::KanMX ura3 cyhR met13-4 
Iys2-d 


R. Borts 


Table 2. Diploid yeast strains. 


Name 


Genotype 


Source or derivation 


MXY29 


a/alpha ura3-1f trp1-1f cyh2R/CYH2 
can1~100/" hi$3-11,15MlS3 Ieu2-3,112f' 
ade2-1/" 


MXY18XMXY25 


MXY33 


a/alpha un*3-1/" trp1-1f cyh2R/CYH2 
can1~100/" h\s3-11,1&" teu2-3,112A.EU2 
ade2-1/" 


MXY22xMXY13 


MXY38 


a/alpha ura3-ir trp1-1/" cyh2RT can1~ 
100/' his3-11,15/HIS3 Ieu2-3,112/LEU2 
ade2-ir 


MXY33-8C x MXY29-6D 


MXY47 


a/alpha msh2::KanMX/MSH2 ura3-1/" 
trpl-1/" cyh2RT can1-100/" Ws3- 
11, 15/HIS3 !eu2-3, 112AEU2 ade2-1/ n 


MXY38 transformed 
with msh2::KanMX 
PGR product 


MXY50 


a/alpha msh2:KanMX/MSH2 ura3-1/" 
trp1-1/» cyh2Rf' can1-10QT /ms3- 
11,15/HIS3 leu2-3,112A-EU2 ade2-1f 
BUD31::URA3-CAN1/BUD31 


MXY47 transformed 
with Afofl -digested 
PMXY9 


MXY51 


a/alpha msh2::KanMX/MSH2 ura3-1/" 
trp1-1f' cyh2RT can1-10QT /i/s3- 
11,1S/HIS3 !eu2-3 t 112/LEU2 ade2-1f 
BUD31::URA3-Oxa7-CAN1/BUD31 


MXY47 transformed 
with A/ofl-digested 
pMXY13 


MXY52 


a/alpha msh2::KanMX/MSH2 u/a3T trpl- 
ir cyhlRT oanl-IQOr his3-11,1S/HIS3 
Ieu2-3,112/LEU2 ade2-1/" BUD31::URA3- 
Oxa11-CAN1/BUD31 


MXY47 transformed 
with A/ofl-digested 
PMXY14 


MXY53 


a/alpha msh2::KanMX/MSH2 wa3~1F 
trp1-1J" cyh2R/ tt can1-10QT his3- 
11,1&HIS3 leu2-3,112jrLEU2 ade2-1J" 
BUD31:: TRP1-CYH2mUD31 


MXY47 transformed 
with Wort -digested 
PMXY12 


MXY55 


a/alpha msh2::KanMXr MSH2 ura$-ir 
trp1-1/" cyh2Rf'can1-100/" his3- 
11, 15/HIS3 teu2-3,112AJEU2 ade2-1/" 
BUD31::TRP1-Oxa11-CYH2/BUD31 


MXY47 transformed 
with Not 1 -digested 
pMXY22 
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MXY57 


a/alpha msh2::KanMXT ura3-ir trp1-ir 
cyh2RT caM-100/" h!s3-11,1S/HIS3 Ieu2- 
3.112AJEU2 ade2-1/" BUD31::TRP1- 
CYn2/BUDoi..Uri/\&^f\wi 


MXY50-3D x MXY53- 
11D 


MXY59 


a/alpha ura3~1f trp1-1/" cyh2RT canl- 
100f his3-11,15MIS3 Ieu2-3,112A£U2 
ade2-1/" BUD31::TRP1-CYH2/ 
BUD31::URA3-CAN1 


MXY50-7D x MXY53- 
11C 


MXY60 


a/alpha ura3-1/" trp1-1/" cyh2RT can1~ 
100r hi$3-11,15M!S3 \eu2~3,112A-EU2 
ade2-1f . BUD31::TRP1-0xa11- 
CYH2/BUD3 1::URA3-Oxa 1 7-C/UV7 


MXY52-7D x MXY55- 
2B 


MXY62 


a/alpha msh2::KanMXr urao-ir irpi-i/ 
cyh2RT can1~100/" Ns3-11,15/HlS3 Ieu2- 
3,112A.EU2 ade2-1f' BUD31::TRP1- 
Oxa11-CYH2/BUD31::URA3-Oxa11>. 
CAN1 


R/IVY*;9-5>A x MXYS5- 
1C 


MXY64 


a/alpha ura3-if trpi-u cynzrv cani- 
7 W/ nis3-1 1, 7 o/n/oo leutrj, 1 1 cjuz%j& 
ade2-1f BUD31::URA3-Oxa7- 
CAN1/BUD3 1::TRP1-Oxa11~CYH2 


MXY51-1DC x MXY55- 
2B 

few 


MXY66 


a/alpha msh2::KanMX/" ura3-1/" trpW 
cyh2Rr cani-iUQf n/so-77,7ivn/oo i&u£- 
3.112A.EU2 ade2-1/" BUD31::TRP1- 
Oxa11-CYH2mUD31::URA3-Oxa7-CAN1 


MXY51-2B X MXY55- 
13D 








WIYY7Q 


a/alpha mshZ':KanMX/MSH2 ura3-1/" 
trp1-1/" cyh2R/" can1~100/" his3- 
11,15/HIS3 Ieu2-3,112/LEU2 ade2-1f' 
BUD31::URA3-Oxa5~CAN1/BUD31 


MXY47 transformed 
with Afofl -digested 
pMXY24 














MXY99 


a/alpha ura3~1f trp1-1/" cyh2RT car>1- 
100/* his3-1 1, 1ST !eu2-3, 1 12/LEU2 ade2- 
ir BUD31::URA3-Oxa5- 
CAN1/BUD31::TRP1-0xa1 1-CAN1 


MXY79-3B x MXY55- 
2B 


MXY102 


a/alpha msh2::KanMXT ura3-V" trp1-1f 
cyh2R/ n can1-100/can1 hi$3-11,15/HIS3 
leu2-3, 1 12/LEU2 ade2-1/» BUD31::URA3- 
Oxa5~CAN1/BUD31::TRP1-Oxa11-CAN1 


MXY79-9A x MXY55- 
1C 



1.3 Plasmid construction 

The bacterial strains XL1-Blue MRF {AmcrA)183(mcrCB-hsdSMR~ 
mrr)173 endA1 supE44 thi-1 recA1 gyrA96 relA1 lac IF proAB 
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lacl"ZAM15 Tn10 (Tef)]) and JM110 {rpsL [Stf] thr leu thi-1 lacY 
galK galT ara tonA tsx dam dcm supE44 A[lac-proAB] [F traD36 
proAB lacPZLM15J) were used as hosts for cloning. Standard meth- 
ods were used for plasmid construction (Ausubel et al.). All plasmids 
5 used or created in this study are listed in Table 3. Restriction en- 
zymes, T4 DNA ligase and other enzymes used in cloning were pur- 
chased from New England BioLabs. DNA fragments and plasmids 
were purified using kits supplied by Qiagen and Macherey-Nagel. 

Upstream ("5' target") sequences corresponding to the BUD31 locus 

10 were amplified by preparatory PCR from W303 genomic DNA with 
the primer pair KNS3/KNS4 and cloned as a Kpn\IXho\ fragment 
into KpnlfXhol -digested pKSII(+) (Stratagene) to create pMXY1; 
downstream ("3' target") targeting sequences were similarly ampli- 
fied with the primer pair KNS1/KNS2 and cloned as a XbaMNott 

15 fragment into XbaVNott -digested pKSII(+) (Stratagene) to create 
PMXY2. The TRP1 marker was excised from pJH53 (a gift of R. 
Borts) as a Bgr/ll/EcoRI fragment and ligated to BamHI/EcoRI- 
digested pMXY1 to create pMXY3, and the URA3 marker was ex- 
cised from Xnol/H/nDIII-digested pRED316 (a gift of R. Borts) and 

20 ligated to Xnol/H/nDIII-digested pMXY1 to create pMXY4. The CAN1 
marker was isolated from pRED316 as a Sma1 fragment and ligated 
to Hpa1 -digested pMXY2 to create pMXY5. The 5' targeting se- 
quences in pMXY3 and pMXY4 were replaced with sequences re- 
amplified from genomic DNA with the primer pair KNS4/KNS6 and 

25 ligated as Kprr\fXho'\ fragments into the respective Kpn1/Xno1- 
digested plasmids to produce pMXY7 and pMXY6. This step was 
undertaken to correct the absence from the primer KNS3 of restric- 
tion sites required in later phases of cloning. The Kpn\-Sma\ frag- 
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ment of pMXY6 containing the 5' target and the URA3 marker were 
ligated to Kpnl/Smal-digested pMXY5 to produce the URA3-CAN1 
recombination cassette vector pMXY9. The Kpn\ISpe\ fragment of 
pMXY7 containing the 5* target and TRP1 marker were ligated to 

5 Kpnl/Spel-digested pMXY2 to produce pMXY11. Finally, the CYH2 
marker was amplified by preparatory PCR from W303 genomic DNA 
with the primer pair KNS17/KNS18, digested with BamHl and Pvu\, 
and ligated to Bg/ll/Pad-digested pMXY11 to create the TRP1-CYH2 
recombination cassette vector pMXY12. All plasmid constructs were 

10 introduced into bacterial hosts by electroporation and verified by re- 
striction analysis, and pMXY9 and pMXY12 were further verified by 
sequencing of all cloning junctions. N 

B-lactamase recombination substrates were amplified by preparatory 
PCR from host plasmids (provided by W. Schoenfeld) using the 

15 primer pairs KNS36/KNS37 for Oxa5 (accession X58272), 
KNS7/KNS8 for Oxa7 (accession X75562), and KNS9/KNS10 for 
Oxa11 (accession Z22590). Oxa7 and Oxa11 PCR products were 
digested with Pad and ligated to Smal/Pad-digested pMXY9 to cre- 
ate pMXY13 and pMXY14, respectively, and the Oxa11 PCR product 

20 was also digested with Spel and Pad and ligated to SpeVPad- 
digested pMXY12 to create pMXY22. The Oxa5 PCR products were 
digested with BamHI and Pad and ligated to Bg/ll/Pad-digested 
pMXY9 to create pMXY24. All constructs were verified by restriction 
analysis. 
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Table 3. Plasmids 



Name 


uescnption or insert 


Source 


ptvoll (+y 


Parental vector 


Stratagene 


PKCUO JO 


u/\/io and u/uvT source 


R. oorts 




TnHi source 


R. Boris 


pMXY1 


5' target 


This work 


pMXY2 


3 target 


This work 


pMXY3 


5' target- TKPf 


This work 


pMXY4 


5 1 target-l/RA3 


This work 


pMXYS 


CyWf-3' target 


This work 


pMXY6 


5* target-UR/13 


This work 


pMXY7 


5' target- TRP1 


This work 


PMXY9 


5' target-UR/l3-C/lA/f-3' target 


This work . 


pMXY11 


5' target- TRP1-y target 


This work 


PMXY12 


5' target- TRP1-CYH2-y target 


This work 


PMXY13 


5' target-afM^xaT-CiWW 
target 


This work 


pMXY14 


5' target-U/?430xa1 1-CAW-3' 
target 


This work 


pMXY22 


5' target- 7KPf-Oxa1 1-CYtf2-3' 
target 


This work 








PMXY24 


5' target-^y/W3-Oxa6.CA/Vf-3 , 
target 


This work 



14 Recombinant selection and characterization 



For first round recombination, plasmids bearing recombination cas- 
settes were digested with Afofl and total digestion products were 
used to transform MXY47. Uracil (for pMXY9 derivatives) or trypto- 
phan (for pMXY12 derivatives) prototrophs were selected, and tar- 
geting of one of the two chromosomal copies of the BUD31-HCM1 
locus by the introduced construct was confirmed by colony PCR us- 
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ing the primers KNS12/KNS1 3/KNS1 5 for URA3-CAN1 derivatives 
and the primers KNS12/KNS13/KNS14 for TRP1-CYH2 derivatives, 
which allow fragments from the intact and from the disrupted BUD31- 
HCM1 loci to be amplified. Transformed heterozygotes were 

5 sporulated and tetrad analysis was carried out to identify wild type or 
msh2 haploids bearing recombination cassettes. Appropriate hap- 
loids of opposite mating type were patched on YPD plates, allowed 
to grow overnight, mixed together on the same YPD plate and al- 
lowed to mate overnight. The mating plate was replica plated to - 

10 Ura-Trp medium to select for diploids, which were inoculated the fol- 
lowing day in bulk into SPS plus supplements and cultured over- 
night. The preculture was spun down and washed, and the cells 
were resuspended in 1% K acetate plus supplements and incubated 
for two days. 

15 Sporulated cells were harvested, quantified, and in some cases dis- 
sected to confirm appropriate segregation of all markers. Asci were 
digested with zymolyase-20T (ICN Biomedicals) to liberate spores, 
the spore suspension was sonicated (Branson Model 250 Digital 
Sonifier), and appropriate dilutions were plated on YPD to determine 

20 cell viability, on uracil dropout media containing 60 ug/ml ca- 
navanine (Sigma) to select Ura+CanR recombinants, and on trypto- 
phan dropout media containing 3 ug/ml cycloheximide (Sigma) to 
select Trp+CyhR recombinants. Spore colonies arising on each me- 
dium were counted and subjected to phenotypic and molecular tests 

25 to determine whether they represented true recombinants. For 
phenotypic analysis, a representative number of candidate 
recombinants was.restreaked to the same medium used for selection 
and then replica plated to -Una, -Trp, cycloheximide (10 ug/ml), 
canavanine (60 ug/ml), and mating type tester plates. 
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Spores were also plated on - 
Ura-Trp media to determine the frequency of diploids for each spore 
preparation, which in all cases was lower than 4% of total viable 
cells. For molecular analysis, total genomic DNA was subjected to 
5 analytical PCR (see below) using appropriate primer pairs that spe- 
cifically amplify parental or recombinant fragments. The frequencies 
of recombination for a given selection are expressed as the fre- 
quency of viable cells on a given selection medium, corrected for the 
presence of non-recombinants exhibiting a false positive phenotype. 
10 In most cases, such false positives arose by mutational inactivation 
of the CAN1 or CYH2 marker, as suggested by analytical PCR. 

For second round recombination, appropriate recombinants derived 
from the first round of recombination were mated and Ura+Trp+ dip- 
loids were selected. The same sporulation procedure as for first 

15 round recombination was followed, except that spores were plated 
on YPD, on uracil dropout media containing cycloheximide to select 
Ura+CyhR recombinants, and on tryptophan dropout media contain- 
ing canavanine to select Trp+CanR recombinants. Candidate re- 
combinants were similarly subjected to phenotypic and molecular 

20 analysis. 

1.5 Molecular methods 

Genomic DNA used as a template for preparatory or analytical PCR 
was prepared from overnight YPD cultures by a standard miniprep 
procedure according to Ausubel et al. Preparatory PCR of fragments 
25 used in cloning or for sequencing was performed with Oxa plasmid 
DNA (approximately 50 pg) or yeast genomic DNA (approximately 
0.5 ng) as a template in 50 p\ reactions containing 2.5 U Herculase 
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polymerase (Stratagene), 1x Herculase reaction buffer, 0.2 mM each 
dNTP and 100 ng each primer. Amplification was carried out as fol- 
lows: 94° C 2 min; 30 cycles of 94°C 10s, 55°C 30 s, 72°C 30s; 68°C 
10 min. A modified colony PCR procedure was employed to confirm 
integration of recombination cassettes at the BUD31 locus 
rhttD://www.fhcrc.ora/labs/hahn/methods/mol bio meth/ocr veast c 
olonv.htmn . with the following amplification conditions: 95°C 5 min; 
35 cycles of 95°C 1 min, 55°C 1 min, 68°C 1 min; 72°C 10 min. Ana- 
lytical PCR to characterize Oxa inserts was carried out in 100 ixl re- 
action volumes containing approximately 0.5 ng genomic DNA pre- 
pared from candidate recombinants and control strains, 1.5 U Taq 
polymerase (Roche), 1 x reaction buffer, 0.2 mM each dNTP and 
100 ng each primer, with the same amplification conditions as for 
colony PCR, except that extension was carried out at 68°C for 2 min. 
All amplification reactions were performed with a Mastercycler gradi- 
ent 5331 (Eppendorf). 

For sequence analysis of recombinant Oxa inserts, preparatory PCR 
was carried out with the primer pairs KNS16/KNS29 (for Ura+CanR 
recombinants) or KNS11/KNS28 (for Trp+CyhR), followed by purifi- 
cation with the Qiaquick PCR kit (Qiagen). PCR products were se- 
quenced by Genome Express (Meylan, FR) with the primers KNS30, 
KNS31, KNS33 or KNS38, as appropriate. Recombinant sequences 
were aligned and analyzed using Clone Manager software (Sci Ed 
Central). Oligonucleotides used in PCR and sequencing (Table 3) 
were purchased from Proligo France. 



2. Results 
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2.1 Development of a veast meiotic homeolooous recombination 
system 

A strategy that makes use of the yeast Saccharomyces cerevisiae to 
promote in vivo recombination between diverged DNA sequences 
5 has been developed. Critical features of the strategy include the use 
of meiotic cells, in which high levels of genome-wide recombination 
take place, and inactivation of the mismatch repair (MMR) system, 
which normally restricts recombination between diverged sequences. 
Sequences to be recombined, i.e. the recombination substrates, are 

10 introduced into one of two vectors that also bears flanking marker 
sequences, so as to create recombination cassettes. The recombi- 
nation cassettes are introduced into the yeast genome, at a locus on 
chromosome III (the BUD31-HCM1 interval), which is in a region 
known to be recombinationally active in meiosis. Diploids heterozy- 

15 gous for recombination cassettes are sporulated, and spores are 
plated on media that select for cells with specific configurations of 
flanking markers, thereby allowing for the selection of recombinants 
in which a crossover involving recombination substrates has taken 
place (Figure 1). 

20 Two general recombination cassette vectors were constructed, 
pMXY9 and pMXY12, which contain the URA3 and CAN1, and the 
TRP1 and CYH2 markers, respectively, flanking restriction sites that 
can be used for the introduction of recombination substrates (Figure 
2). The URA3 marker confers uracil prototrophy, and the CAN1 

25 marker confers canavanine sensitivity. In the absence of this marker, 
cells are resistant to the drug. The TRP1 marker confers tryptophan 
prototrophy and the CYH2 marker confers cycloheximide sensitivity. 
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In the absence of this marker, cells are resistant to the drug. Each of 
the two recombination cassettes is in turn flanked by sequences that 
allow targeting of the entire insert to the BUD31-HCM1 locus by 
transformation of competent cells (Figure 2). A strain that serves as 
5 a primary host for transformation, MXY47, was also constructed (Ta- 
ble 2). This diploid is heterozygous for the msh2::KanMX mutation, 
and is phenotypically wild type with respect to MMR. It is also homo- 
zygous for the ura3-1, trp1-1, can1-100 and cyh2R markers, which 
allows the presence of recombination cassette markers to be moni- 
1 0 tared, and heterozygous for the his3-1 1,15 and leu2-3, 112 markers. 
MXY47 is transformed with fragments bearing recombination cas- 
settes, primary transformants are selected as Ura+ or Trp+ proto- 
trophs (for MXY9 and MXY12 derivatives, respectively), and target- 
ing is confirmed by analytic PCR using primers that recognize se- 
15 quences within and external to the introduced construct. Primary 
transformants are sporulated, and tetrads are dissected and replica 
plated to identify wild type or msh2 segregants that bear the recom- 
bination cassette. Suitable haploids are mated to one another to 
generate MSH2/MSH2 (wild type) and msh2/msh2 diploids het- 
20 erozygous for recombination substrates. In a first round of meiotic 
recombination to generate recombinants, these diploids are 
sporulated, and free spores are plated on media lacking uracil and 
containing canavanine to select for recombinants with the URA3- 
CYH2 configuration of flanking markers (Ura+CanR spore colonies), 
25 or on media lacking tryptophan arid containing cycloheximide to se- 
lect for the TRP1-CAN1 configuration (Trp+CyhR spore colonies). 
Parental diploids and non-recombinant haploid progeny cannot grow 
on these media. The frequency of spore colonies arising on selective 
media is determined, candidate recombinant spore colonies are 
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characterized phenotypically by replica plating on test media and 
molecularly by PCR with appropriate primer pairs, and a sample of 
confirmed recombinants is selected for sequencing. 

The strategy is iterative in that cells bearing recombinant inserts can 
5 be identified and subjected to further rounds of meiotic recombina- 
tion to increase diversity. In a second round, Ura+CanR and 
Trp+CyhR haploids are mated, and the sporulation and selection 
process is repeated, except that new recombinants are selected on 
media lacking uracil and containing cycloheximide, to select for re- 

10 combinants with the URA3-CAN1 configuration of flanking markers 
(Ura+CyhR spore colonies), or on media lacking tryptophan and con- 
taining canavanine to select for the TRP1-CYH2 configuration 
(Trp+CanR spore colonies). The strategy detailed here can also be 
modified to include additional markers to increase the stringency of 

15 selection. Furthermore, recombinants can also be directly selected 
by PCR using primers specific to flanking sequences. 

2.2 Phenotvoic selection for recombination between Oxa gene 

pairs of varying sequence divergence 

Genes belonging to the Oxa superfamily of beta-lactamases were 
20 chosen as substrates to test the feasibility of the system for the se- 
lection of recombinants. Recombination between the following Oxa 
pairs was assessed in the wild type and msh2 backgrounds: Oxa1 1- 
Oxa1 1, which share 100% homology throughout the 800 bp ORF; 
Oxa7-Oxa11, 95%; Oxa5-Oxa11, 78%. Diploids generated by 
25 crosses between appropriate haploids were induced to enter meio- 
sis. Spores were prepared from meiotic cultures, and serial dilutions 
were plated on YPD to determine cell viability and on medium lack- 
ing uracil and containing canavanine (-Ura+Can) and on medium 
lacking tryptophan and containing cycloheximide (-Trp+Cyh) to se- 
30 lect for recombinants. 
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2.3 Frequencies of recombina tion between Oxa genes of varying 
sequence homology 

The data shown in Figure 3 demonstrate that in the wild type back- 
ground, increased sequence heterology has a strong inhibitory effect 

5 on crossover recombination, and that this effect is relieved but not 
abolished by the msh2 mutation. In general, the msh2 mutation 
causes an increase in the frequency of recombination of about one 
order of magnitude above that observed for wild type strains at the 
two levels of. divergence tested. However, inactivation of MSH2 

10 alone does not fully compensate for the inhibition of recombination 
between recombination substrates with higher degrees of heterol- 
ogy. For example, the frequencies of recombination for a msh2 strain 
with Oxa inserts sharing 78% homology (MXY102) are at least 10- 
fold (Ura+CanR) and 25-fold (Trp+CyhR) below those found for a 

15 wild type strain with Oxa inserts of 100% homology (MXY60), indi- 
cating that factors other than MSH2-dependent mismatch repair pre- 
vent crossover recombination between more diverged sequences. It 
is noteworthy that the appearance of msh2 recombinants at the 78% 
divergence level, at frequencies of roughly 2 x 10" 4 , indicates that 

20 recombination may be achieved between even more divergent sub- 
strates. 

2.4 The msh2 hvper-recombination effect 

The effect of msh2 on homologous and homeologous recombination 
25 was quantified by first calculating the ratio of msh2 to wild type 
recombinants for a given percent of homology for a given selection 
for each experiment, and then calculating the means and standard 
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deviations of the ensemble of ratios thus determined. The data are 
shown in in Figure 4. The presence of the msh2 mutation increases 
the frequency of homeologous recombination for sequences of 95% 
and 78% homology, and there is a less pronounced but still quantifi- 
5 able enhancement of recombination between 100% identical se- 
quences. Furthermore, the extent of the msh2 enhancement of ho- 
meologous recombination differs for the two selections: for strains 
with homeologous Oxa inserts, inactivation of MSH2 increases the 
frequency of Trp+CyhR recombinants to a greater extent than it in- 

10 creases the frequency of Ura+CanR recombinants. In principle, the 
frequencies of both types of recombinants (Ura+CanR and 
Trp+CyhR) should be equivalent, but these numbers indicate that 
there are biases in the system that are provoked or enhanced by the 
msh2 mutation, in conjunction with variations in the extent of se- 

15 quence divergence. Experiments to test the relative influences of 
inserts and flanking marker sequences on the types of recombinants 
obtained indicate that this bias is a property of the flanking markers 
but the influence of the recombination substrates in directing the out- 
comes of meiotic recombination events cannot yet be accounted for 

20 (data not shown). 

2.5 PGR analysis of selected recombinants 

An example of PCR analysis, as applied to Ura+CanR and 
Trp+CyhR spore colonies derived from wild type and msh2 diploids 
25 containing Oxa genes of 22% divergence (MXY99 and MXY102, 
respectively) is shown in Figure 5. For each strain, ten spore colo- 
nies that exhibited each recombinant phenotype were analyzed. 
Extracts from each colony were used as templates for amplification 
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with primer pairs that specifically amplify parental molecules and 
with primer pairs that specifically amplify recombinant molecules. 
In every case(s), only the predicted recombinant insert was ampli- 
fied, indicating that the selected spore colonies contained se- 

5 quences produced by recombination between the parental Oxa re- 
combination substrates. These results also demonstrate that recom- 
binant molecules can be directly recovered from sporulated cultures, 
even without the imposition of a genetic selection step. Here, the 
primer recognition sites located in the URA3, CAN1, TRP1 and 

10 CYH2 genes represent molecular marker sequences that flank each 
recombination substrate. 

2.6 Sequence analysis of first round nrteiotic hom a° loaous recom- 
binants: 5% a nd 22% divergence 

Oxa7-Oxa11 meiotic recombinants derived from wild type and msh2 
15 diploids (MXY64 and MXY66, respectively), which contain recombi- 
nation substrates sharing 95% homology, that satisfied phenotypic 
and molecular (PCR) tests were subjected to sequence analysis. 
Recombinant fragments were amplified with primers specific to flank- 
ing markers and sequenced using primers close to the translational 
20 start and stop sites. Overall, 55 recombinant sequences derived from 
haploid progeny of Oxa7-Oxa 11 diploids were analyzed: 14 
Ura+CanR and 13 Trp+CyhR recombinants from MXY64, and 14 
Ura+CanR and 14 Trp+CyhR recombinants from MXY66. The se- 
quenced sample size allows several observations to be made. 1 ) For 
25 both wild type and msh2 recombinants, the position at which the 
crossover took place ranged throughout the full coding region, with 
no apparent preference for a specific interval. Crossovers that oc- 
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curred in the 5' region were as likely as those in the 3' region. Also, 
for a given strain, there was no apparent difference in the distribu- 
tions of crossover sites for spore colonies obtained by -Ura+Can or 
by -Trp+Cyh selection. 2) The length of uninterrupted homology in 
5 the crossover interval was also unimportant: crossovers were de- 
tected between two closely spaced polymorphisms (positions 543- 
552 for MXY66 Trp+CyhR #7, #8, and #13, where position 1 repre- 
sents the adenosine residue of the ATG translation^ start site) as 
well as between the two most widely spaced polymorphisms (posi- 

10 tions 163-265, eg MXY66 Trp+CyhR #15). 3) The recombinant Oxa 
inserts isolated from both wild type and msh2 backgrounds con- 
tained full-length recombinant sequences potentially capable of en- 
coding new, functional Oxa proteins. That is, all crossovers occurred 
in such a manner as to preserve an intact ORF, without a net inser- 

15 tion or deletion of nucleotides in the crossover interval or in any 
other interval. 4) Although the structures of most recombinant se- 
quences are consistent with a simple crossover between the two 
Oxa sequences in local regions of homology, several recombinants 
isolated in the msh2 background exhibited greater complexity. Se- 

20 quences derived from four recombinants (MXY66 Ura+CanR #16 
and #31, and MXY66 Trp+CyhR #5 and #9) exhibited a higher de- 
gree of mosaicism, as if they were produced by more than one 
crossover event. Indeed, analysis of two of these recombinants was 
complicated because inspection of electropherograms revealed the 

25 presence of two overlapping peaks at multiple sites within the se- 
quenced region, each site corresponding to an Oxa 7-Oxa1 1 poly- 
morphism. This observation indicates that the population of mole- 
cules that was sequenced was heterogeneous, for which the most 
likely explanation is the presence of unrepaired or partially repaired 
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heteroduplex DNA present in msh2 recombinant spores. This 
interpretation is consistent with the known increased frequency of 
post-meiotic segregation (PMS) caused by the msh2 mutation. In 
these two cases, MXY66 Ura+CanR #16 and #31, one or more 

5 repaired sites was flanked by stretches of unrepaired heteroduplex, 
consistent with the unmasking of a short patch mismatch repair 
activity in the msh2 background, as suggested by Coic, Gluck and 
Fabre (EMBO J. 19:3408). Several other cases of PMS unassociated 
with short-patch mismatch repair were also observed for msh2 

10 recombinant sequences, indicating that this alternative mismatch 
repair system may not be highly efficient at correcting mismatches in 
heteroduplex DNA. Judging from sequence electropherograms, the 
extent of uncorrected heteroduplex varied, from a short region of 
about 50 nt to a region almost covering the entire ORF. No evidence 

15 for PMS or short-patch mismatch repair was found for wild type 
recombinant sequences. Overall, these findings suggest that the 
extent of diversity created is greater in msh2 meiosis than in wild 
type meiosis. 

Meiotic recombinants were also derived from wild type and msh2 
diploids that contain recombination substrates sharing 78% homol- 

20 ogy, (MXY99 and MXY102, respectively). In total, 24 recombinant 
sequences derived from recombinant progeny of Oxa5-Oxa 11 dip- 
loids were analyzed: five Ura+CanR and three Trp+CyhR recombi- 
nants from MXY99, and nine Ura+CanR and seven Trp+CyhR re- 
combinants from MXY102. Inspection of these sequences suggests 

25 several trends. 1) Recombinant Oxa sequences obtained In both 
wild type and msh2 strains by selection on -Trp+CyhR exhibited 
crossovers at different positions throughout the ORF, with perhaps a 
slight tendency towards the middle 250 bp region (nt 333-nt 573) of 
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overall shared homology. In contrast, recombinants obtained in both 
wild type and msh2 strains by selection on -Ura+CanR exhibited a 
pronounced bias in the positions of crossovers: in 3 of 5 wild type 
and 8 of 9 msh2 sequences, crossovers occurred within the last 80 
5 nt of the region of shared homology, ie, the last 10% of the ORF. 2) 
The intervals of absolute homology in which crossovers were identi- 
fied ranged from 11 to 20 nt for the -Trp+Cyh selection, indicating a 
preference for these relatively larger regions of sequence identity. In 
contrast, crossover intervals were shorter for the -Ura+Can selec- 

10 tion, ranging from 3 to 17 nt (13/14 of these involved intervals 13 nt 
and shorter). 3) As for recombination involving sequences sharing 
95% homology, the new sequences obtained also consisted of intact 
ORFs and potentially encode novel Oxa proteins. 4) No cases of 
PMS, as judged by inspection of electropherograms, were found for 

15 wild type recombinants, but very short patches of unrepaired hetero- 
duplex were found for a few msh2 recombinants, including 3 of the 7 
Trp+CyhR recombinants. These regions included at most 67 nt, 
shorter than some of the tracts observed for Oxa7-Oxa1 1 recombi- 
nants. In sum, these observations indicate that recombinant se- 

20 quences can be selected from input recombination substrates vary- 
ing by at least 22%, and that these sequences encode novel pro- 
teins. 

2.7 Sequence analysis of Oxa7-Oxa11 second-round recombi- 
nants 

25 The ability of the yeast system to increase sequence diversity in an 
iterative manner was tested by constructing diploids from Oxa7- 
Oxa11 recombinant haploids generated in a first round of meiosis 
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and subjecting these new diploids to a second round of meiosis. 
Among the sequenced Ura+CanR and Trp+CyhR progeny of MXY64 
and MXY66, pairs of appropriate recombinants with crossovers in 
the same interval were selected to construct new diploids in which 
5 the overall level of sequence homology was again 95%. Three wild 
type (MXY81 , MXY82 and MXY83) and three msh2 (MXY86, MXY87 
and MXY88) diploids were created. Control wild type and msh2 dip- 
loids containing only Oxa11 sequence inserts were also constructed 
from appropriate recombinant progeny of MXY60 and MXY62, yield- 
10 ing MXY90 and MXY92. These diploids were sporulated and spores 
were plated on medium lacking uracil and containing cycloheximide 
(-Ura+Cyh) and on medium lacking tryptophan and containing ca- 
navanine (-Trp+Can) to select for second-round recombinants. As 
shown in Figure 6, the frequencies of -Ura+CyhR and Trp+CanR 
15 spore colonies observed for all of these strains is consistent with the 
anti-recombination effect of the MSH2 gene. Both types of colonies 
were found among progeny of the wild type homozygote MXY90 at 
frequencies above 10" 3 , whereas these frequencies were decreased 
5- to 10-fold among progeny of wild type diploids with Oxa insert 
20 heterology (MXY81 , MXY82 and MXY83). Inactivation of the MSH2 
gene in diploids with diverged Oxa inserts (MXY 86, MXY 87, and 
MXY 88) led to a 2 to 6-fold increase in the frequency of Ura+CyhR 
and Trp+CanR spore colonies, similar to the levels seen for a msh2 
diploid bearing identical Oxa inserts (MXY92). Although the media 
25 used differ from those used for selection of first-round recombinants, 
the frequencies at which wild type and msh2 second-round recombi- 
nants were selected are comparable to those for first-round recom- 
binants. 
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Ura+CyhR and Trp+CanR spore colonies in both the wild type 
(MXY81 and MXY83) and msh2 (MXY86) backgrounds were se- 
lected for sequencing. In all, 14 wild type and 7 msh2 Oxa inserts 
were sequenced. In most cases, a crossover occurred in a novel in- 
5 terval during second round recombination, again without apparent 
bias with respect to position or interval size: crossovers involving 
different intervals were found throughout the Oxa ORF and they oc- 
curred in intervals as large as 101 nt and as small as 5 nt. In one 
case (a MXY83 Trp+CyhR haploid), a second round crossover oc- 

10 curred in the first round crossover interval, thereby restoring a full 
Oxa1 1 sequence. Recombinants recovered from msh2 diploids were 
more diverse than those recovered from wild type strains. For the 
msh2 diploid MXY86 several spore colonies exhibiting extensive 
PMS and sequence mosaicism were observed, consistent with the 

15 formation of long tracts of heteroduplex in the recombinational inter- 
mediate. Furthermore, some mismatches were repaired in the. hete- 
roduplex tract, again consistent with a short-patch mismatch repair 
activity. In sum, second-round recombination in the msh2 back- 
ground is as efficient as first-round recombination, both qualitatively, 

20 with respect to generating sequence diversity (eg., crossover interval 
distribution and incidence of PMS), and quantitatively, with respect 
to increasing the overall frequency of homeologous (5% divergence) 
recombination. 
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Claims 

1. Process for generating and detecting recombinant DNA se- 
quences in Saccharomyces cerevisiae comprising the steps of: 

a) generating first diploid S. cerevisiae cells bearing in a defined 
locus of their genome a first recombination cassette compris- 
ing a first DNA sequence to be recombined, which is flanked 
by at least a first and a second marker sequences, and in an 
allelic position a second recombination cassette comprising a 
second DNA sequence to be recombined, which is flanked by 
at least a third and a fourth marker sequences, 

b) inducing the sporulation of the first diploid cells obtained in a) 
and 

c) isolating haploid cells containing recombination cassettes in 
which first recombined DNA sequences are flanked by at least 
the first and fourth marker sequences, and haploid cells con- 
taining recombination cassettes in which second recombined 
DNA sequences are flanked by at least the second and the 
third marker sequences. 

2. Process according to claim 1 , comprising further the steps of: 

a) generating second diploid cells by mating haploid cells con- 
taining the first recombined DNA sequences obtained in 1c) 
with haploid cells containing second recombined DNA se- 
quences obtained in 1c), 
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b) inducing the speculation of the second diploid cells obtained 
in a) and 

c) isolating haploid cells containing recombination cassettes in 
which third recombined DNA sequences are flanked by at 

5 least the first and the second marker sequences, and haploid 

cells containing fourth recombination cassettes in which fourth 
recombined DNA sequences are flanked by at least the third 
and the fourth marker sequences. 

3, Process according to claim 1 or 2, wherein further recombined 
10 DNA sequences are generated by subjecting the haploid cells ob- 
tained in 2c) at least once to another cycle of mating with other hap- 
loid cells, inducing the sporulation of the diploid cells obtained and 
isolating haploid cells with recombined DNA sequences on the basis 
of the molecular linkage between two marker sequences. 

15 4. Process according to any one of claims 1 to 3, wherein the first 
diploid cell is generated by simultaneously or sequentially transform- 
ing a diploid S. cerevisiae cell with a DNA molecule containing the 
first recombination cassette and a DNA molecule containing the sec- 
ond recombination cassette and optionally allowing the integration of 

20 the two recombination cassettes into allelic positions of the S. cere- 
visiae genome. 



5. Process according to claim 4, wherein the DNA molecule compris- 
ing the first or the second recombination cassettes is a yeast artifi- 
cial chromosome (YAC). 
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6. Process according to claim 4, wherein the DNA molecule compris- 
ing the first or the second recombination cassettes is a cloning vehi- 
cle, whereby the respective two marker sequences are flanked by 
targeting sequences which are homologous to a defined locus of the 

5 S. cerevisiae genome. 

7. Process according to according to any one of claims 1 to 3, 
wherein the first diploid cell is generated by fusing a haploid S. cere- 
visiae cell bearing in a locus of its genome the first recombination 
cassette with a haploid S. cerevisiae cell bearing in an allelic posi- 

1 0 tion the second recombination cassette. 

8. Process according to according to any one of claims 1 to 3, 
wherein the first diploid cell is generated by mating a haploid S. cer- 
evisiae cell bearing in a locus of its genome the first recombination 
cassette with a haploid S. cerevisiae cell bearing in an allelic posi- 

1 5 tion the second recombination cassette. 

9 process according to claim 7 or 8, wherein haploid cells bearing 
the first or second recombination cassette are generated by: 

a) inserting the first DNA sequence to be recombined between 
the first and the second marker sequences located adjacently 

20 on a first cloning vehicle and inserting the second DNA se- 

quence to be recombined between the third and the fourth 
marker sequences located adjacently on a second cloning 
vehicle, whereby the respective two marker sequences are 
flanked by targeting sequences which are homologous to a 

25 defined locus of the S. cerevisiae genome, 
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b) excising from the cloning vehicles obtained in a) fragments 
bearing the first recombination cassette and the second re- 
combination cassette, respectively, whereby each of the cas- 
settes comprises the DNA sequence to be recombined 
5 flanked by the respective two marker sequences, and each 

cassette in turn is flanked by targeting sequences, 

transforming the fragments bearing the recombination cas- 
settes with flanking targeting sequences obtained in b) sepa- 
rately into S. cerevisiae diploid cells, whereby the targeting 
sequences direct the integration of the cassettes into that lo- 
cus to which they are homologous, in order to obtain diploid 
cells heterozygous for the first cassette, or the second cas- 
sette, 

d) inducing separately the sporulation of the heterozygous dip- 
1 5 loid cells obtained in c) and 

e) isolating haploid cells containing the first cassette and ex- 
pressing the first and second marker sequences and sepa- 
rately haploid cells containing the second cassette and ex- 
pressing the third and the fourth marker sequences. 

20 10. Process according to claim 9, wherein the first cloning vehicle is 
plasmid pMXY9 and the second cloning vehicle is plasmid pMXY12. 

1 1 . Process according to any one of claims 4-6, 9 or 10, wherein the 
diploid S. cerevisiae cells used for transformation are auxotrophic for 
at least two nutritional factors. 



c) 



10 
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12. Process according to claim 11, wherein the diploid cells are ho- 
mozygous for the ura3-1 allele and the trp1-1 allele, which render 
them auxotrophic for uracil and tryptophan, respectively. 

13. Process according to any one of claims 4-6 or 9-12, wherein the 
diploid cells used for transformation are resistant to at least two anti- 
biotics. 

14. Process according to claim 13, wherein the diploid cells are ho- 
mozygous for the can1-100 allele and the cyh2R allele, which render 
them resistant to canavanine and cyclohexlmlde, respectively. 

15. Process according to any one of claims 4-6 or 9-14, wherein dip- 
loid cells of the S. cerevisiae strain MXY47 are used for transforma- 
tion, which are homozygous for the alleles ura3-1 , trp1-1, can1-100 
and cyh2R and heterozygous for the msh2::KanMX mutation. 

16. Process according to any one of claims 1 to 15, wherein the S. 
cerevisiae cells have a functional mismatch repair system. 

17. Process according to any one of claims 1 to 15, wherein the S. 
cerevisiae cells are transiently or permanently deficient in the mis- 
match repair system. 

18. Process according to claim 17, wherein the transient or perma- 
nent deficiency of the mismatch repair system is due to an mutation 
and/or an inducible expression or repression of one or more genes 
involved in the mismatch repair system, a treatment with an agent 
that saturates the mismatch repair system and/or a treatment with an 
agent that globally impairs the mismatch repair. 



67 



19. Process according to any one of claims 1 to 18, wherein the first 
and the second recombination cassettes are integrated in the 
BUD31-HCM1 locus on chromosome III of the S. cerevisiae genome. 

20. Process according to any one of claims 1 to 19, wherein the first 
5 and the second DNA sequences to be recombined diverge by at 

least 1 nucleotide. 

21 . Process according to any one of claims 1 to 20, wherein the first 
and the second DNA sequences to be recombined are derived from 
organisms other than and including S. cerevisiae. 

10 22. Process according to any one of claims 1 to 21, wherein the first 
and the second DNA sequences to be recombined comprise one or 
more non-coding sequences and/or one or more protein-coding se- 
quences. 

23. Process according to any of claims 1 to 22, wherein the marker 
15 sequences are selected from the group consisting of nutritional 

markers, pigment markers, antibiotic resistance markers, antibiotic 
sensitivity markers, primer recognition sites, intron/exon boundaries, 
sequences encoding a particular subunit of an enzyme, promoter 
sequences, downstream regulated gene sequences and restriction 
20 enzyme sites. 

24. Process according to claim 23, wherein the first and third marker 
sequences are nutritional markers, the gene products of which can 
compensate an auxotrophy of a S. cerevisiae cell. 
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25. Process according to claim 24, wherein the first marker 
sequence is URA3, the gene product of which can confer uracil 
prototrophy to an uracil auxotrophic S. cerevisiae cell. 

26. Process according to claim 24, wherein the third marker 
sequence is TRP1, the gene product of which can confer tryptophan 
prototrophy to an tryptophan auxotrophic S. cerevisiae cell. 

27. Process according to claim 23, wherein the second and fourth 
marker sequences are antibiotic sensitivity markers, the gene prod- 
ucts of which can confer sensitivity to an antibiotic to a S. cerevisiae 
cell which is resistant to that antibiotic. 

28. Process according to claim 27, wherein the second marker se- 
quence is CAN1, the gene product of which can confer sensitivity to 
canavanine to a canavanine-resistant s. cerevisiae cell. 

29. Process according to claim 27, wherein the fourth marker 
sequence is CYH2, the gene product of which can confer sensitivity 
to cycloheximide to a cycloheximide-resistant S. cerevisiae cell. 

30. Process according to any one of claims 1 to 29, wherein haploid 
cells containing recombination cassettes with either first, second, 
third or fourth recombined DNA sequences are identified by PCR 
processes in order to detect the presence of the respective marker 
combination. 

31. Process according to any one of claims 1 to 29, wherein haploid 
cells containing recombination cassettes with either first, second, 
third or fourth recombined DNA sequences are identified by plating 
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the haploid cells on media that select for the molecular linkage on 
the same DNA molecule of the respective marker combination. 

32. Process according to claim 31, wherein haploid cells containing 
first recombined DNA sequences are plated on a medium that se- 

5 lects for molecular linkage on the same DNA molecule of the first 
and the fourth marker sequences. 

33. Process according to claim 31, wherein haploid cells containing 
second recombined DNA sequences are plated on a medium that 
selects for molecular linkage on the same DNA molecule of the sec- 

1 0 ond and the third marker sequences. 

34. Process according to claim 31 , wherein haploid cells containing 
third recombined DNA sequences are plated on a medium that se- 
lects for molecular linkage on the same DNA molecule of the first 
and the second marker sequences. 

15 35. Process according to claim 31, wherein haploid cells containing 
fourth recombined DNA sequences are plated on a medium that se- 
lects for molecular linkage on the same DNA molecule of the third 
and the fourth marker sequences. 

36. Piasmid pMXY9, comprising adjacently the URA3 marker gene 
20 and the CAN1 marker gene, whereby the two marker sequences 
flank a polylinker sequence for inserting a DNA sequence to be re- 
combined and whereby the two markers are flanked by targeting se- 
quences homologous to the BUD31-HCM1 locus on chromosome III 
of the S. cerevisiae genome. 
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37. Plasmid pMXY9 according to claim 36, wherein the polylinker 
sequence comprises restriction sites for the restriction enzymes 
Sma\, Xba\, Pad and Bg/ll. 

38. Plasmid pMXY12, comprising adjacently the TRP1 marker gene 
and the CYH2 marker gene, whereby the two marker sequences 
flank a polylinker sequence for inserting a DNA sequence to be re- 
combined and whereby the two markers are flanked by targeting se- 
quences homologous to the BUD31-HCM1 locus on chromosome III 
of the S. cerevisiae genome. 

39. Plasmid pMXY12 according to claim 38, wherein the polylinker 
sequence comprises restriction sites for the restriction enzymes 
Sma\, Spel and Pad. 

40. S. cerevisiae strain MXY47, characterized in that diploid cells 
thereof are homozygous for the alleles ura3-1, trp1-1, can1-100 and 
cyh2R and heterozygous for the msh2::KanMX mutation. 

41. E. coli strain JM101, containing plasmid pMXY9. 

42. E. coli strain DH5a, containing plasmid pMXY12. 

43. Kit comprising at least a first container which comprises cells of 
S. cerevisiae strain MXY47, a second container which comprises 
cells of E. coli strain JM1 01 containing plasmid pMXY9 and a third 
container comprising cells of E. coli strain DH6a containing plasmid 
pMXY12. 

44. Kit comprising at least a first container comprising cells of S. 
cerevisiae strain MXY47, a second container comprising DNA of 
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plasmid pMXY9 and a third container comprising DNA of plasmid 
pMXY12. 
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Abstract 

Generation of recombinant genes in Saccharomyces cerevisiae 

The present invention relates to methods for generating and detect- 
ing recombinant DNA sequences in Saccharomyces cerevisiae and 
plasmids and Saccharomyces cerevisiae cells used for conducting 
the inventive methods. 
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SEQUENCE LISTING 

<110> Mixis France S.A. 

<120> Generation of recombinant genes in Saccharomyces 
cerevisiae 

<130> 18248 

<140> 
<141> 

<160> 33 

<170> Patentln Ver, 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 1 

agaactccag cactccgtat 



<210> 2 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 2 

gagacctatc gccatcttca 



<210> 3 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 3 

tcggttctta ctgccaagtg 



<210> 4 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 4 

gagtggcaag aacttgtagc 



<210> 5 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 5 

catgacgatg aggacatcac 



<210> 6 
<2U> 20 
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<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 6 

cgttcggacc taacatctct 



20 



<210> 7 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 7 

atgcttgatg gtcggaagag 20 



<210> 8 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 



<210> 9 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 9 

aggtcgcctg acgcatatac 20 

<210> 10 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 



<210> 11 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 11 

ccacctagcg gatgactctt. 20 



<210> 12 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 8 

tatgcctctt ccgaccatca 



20 



<400> 10 

tgaagaggag gtcgactacg 



20 



<400> 12 

cgcctcgttc agaatgacac 



20 



<210> 13 
<211> 45 
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<212> DNA 

<213> saccharomyces cefevisiae 
<40-0> 13 

acctctagat cttaattaag ttaacttacg gtgccctttg atcct 



<210> 14 
<211> 31 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 14 

caagcggccg cagcgcatgt gtccgatctt t 



<210> 15 
<211> 39 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 15 

caaggtacca gcggccgctg tgctgtctgc gctgcattc 



<210> 16 
<211> 28 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 16 

acactcgagg caccgtaagg gaggagaa 



<210> 17 
<211> 39 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 17 

caaggtacca gcggccgctg tgctgtctgc gctgcattc 



<210> 18 
<211> 36 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 18 • 

acctctagac tagtcgaccc aaggaggtgc catgaa 



<210> 19 
<211> 35 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 19 

accttaatta agtcgacctg cagcgaattg ttagc 



<210> 20 
<211> 31 
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<212> DNA 



<213> Saccharomyces cerevisiae 



<400> 20 

accactagtc gacaccaaga aggtgccatg a 
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<210> 21 
<211> 35 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 21 

accttaatta agtcgacggt cacgatgctg tactt 35 



<210> 22 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 



<210> 23 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 23 

caccagcgct ctagatacat 20 



<210> 24 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 



<210> 25 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 25 

gagtagcagc acgttcctta 20 



<210> 26 
<211> 41 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 22 

tcattgcctt ctccactctc 



20 



<400> 24 

ggaccagaac tacctgtgaa 



20 



<400> 26 

ccaggatccc gggttaatta agttcccagg aaaccctttc g 



41 



<210> 27 
<211> 28 



61 



<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 27 

caacgatcgg atctgccact ggtaactt 28 

<210> 28 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 28 

ttgtttgtgc acttgcctgc 20 



<210> 29 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 29 

aaagccgtat gactcaccca 20 



<210> 30 
<211> 20 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 30 

gaaagggttt cctgggaact 20 



<210> 31 
<211> 37 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 31 

cgcggatcca ctagtcgacc caccaaggta ccatgaa 37 



<210> 32 
<211> 35 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 32 

accttaatta agtcgaccga gccttgagcg ccttg 35 



<210> 33 
<211> 20 
<212> DNA 

<213> saccharomyces cerevisiae 



<400> 33 

tgagaagatg cggccagcaa 



20 



